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I.  Introduction 


This  supplement  to  the  annual  report  consists  of  the  following  printed 
table  of  contents  and  a  set  of  microfiche  containing  all  papers  and  theses 
produced  with  JSEP  support  and  published  during  the  period  April  1,  1983 

through  March  31 ,  1984. 

This  form  of  reporting  is  modelled  after  that  introduced  by  the  Stanford 
Electronics  Laboratories  for  the  same  purpose.  The  result  is  a  compact  pre¬ 
sentation  of  a  large  quantity  of  information  tdiich  can  be  produced  much  more 
economically  than  printing.  On  the  other  hand,  it  is  realized  that  microfiche 
is  less  convenient  than  a  printed  document.  Therefore,  those  who  are 
interested  in  particular  reprints  may  contact  R.W.  Schafer  to  request  a  xerox 
copy  of  any  of  the  listed  papers. 

II.  List  of  Reprints 

The  reprints  are  organized  by  work  unit  as  in  the  combined  Annual/Pinal 
Report  on  this  contract.  Numbers  in  parenthesis  indicate  reference  to  fiche 
number  and  page.  The  page  numbers  are  coded  to  the  work  unit  numbers.  Note 
that  fiche  #7  contains  this  printed  index. 

2. 1  TWO-DIMHBIOHAL  SIGNAL  PROCESSING  AND  STORAGE 

NOtl  Constrained  Iterative  Signal  Restoration  Algorithms 
R.M.  Mersereau  and  R.W.  Schafer 

A.G.  Katsaggelos  and  R.W.  Schafer,  "Iterative  Deconvolution  Using 
Several  Different  Distorted  Versions  of  an  Unknown  Signal,”  Proc.  1 983 
Int.  Conf.  on  Acoustics,  Speech,  and  Signal  Processing,  Boston,  pp. 
659-662,  April  1983.  (Fiche  #1,  pp.  1-1  to  1-4.) 

M.H.  Hayes  and  R.W.  Schafer,  "On  the  Bandlimited  Extrapolation  of 
Discrete  Signals,"  Proc.  1983  Int.  Conf.  on  Acoustics,  Speech,  and 
Signal  Processing,  Boston,  pp.  1450-1453,  April  1983.  (Fiche  #1,  pp. 
1-5  to  1-8.) 


WU#2  Spectrum  Analysis  and  Esrametrie  Modelling 
R.W.  Schafer  and  R.M.  Mersereau 

R.M.  Mersereau,  E.W.  Brown,  and  A.  Guessoum,  "Row-Column  Algorithms 
for  the  Evaluation  of  Multidimensional  DFTs  on  Arbitrary  Periodic 
Sampling  Lattices,"  Proc.  IEEE  Int.  Conf.  on  Acoustics,  Speech,  and 
Signal  Processing,  pp.  1264-1267,  Apr.  1983.  (Fiche  #1,  pp.  2-1  to 
2-4.) 


R.M.  Mersereau,  "Dimensionality  Changing  Transformation  with  Non- 
Rectangular  Sampling  Strategies,”  in  Transformations  in  Optics, 
(Rhodes,  Saleh,  Fienup,  eds.)  SPIE  Bellingham,  1983  (invited).  (Fiche 
#1,  pp.  2-5  to  2-9.) 

A.  Guessoum,  "Fast  Algorithms  for  the  Multidimensional  Discrete 
Fourier  Transform,”  Ph.D.  Thesis,  Georgia  Institute  of  Technology, 
March  1984.  (Fiche  11,  pp.  2-10  to  2-90  and  Fiche  12  pp.  2-91  to 
2-170.) 


S. J.  Lin,  "Generalization  of  One-Dimensional  Algorithms  for  the 
Evaluation  of  Multidimensional  Circular  Convolutions  and  DFTs,"  M.S. 
Thesis,  Georgia  Institute  of  Technology,  December  1983.  (Piche  82, 
pp.  2-171  to  2-188  and  Piche  #3,  pp.  2-189  to  2-284.) 

P.  A.  Maragos,  R.M.  Mersereau,  and  R.W.  Schafer,  "Two-Dimensioned 
Linear  Predictive  Analysis  of  Arbitrarily  Shaped  Regions,"  Proc,  IEEE 
Int.  Conf.  on  Acoustics,  Speech,  and  Signal  Processing,  pp.  104-107, 
Apr.  1983.  (Piche  84,  pp.  2-285  to  2-288.) 

Signal  Reconstruction  From  Partial  Phase  and  Magnitude  Information 
M.H.  Bayes 

P.L.  Van  Hove,  M.H.  Hayes,  J.S.  Lim,  and  A.V.  Oppenheim,  "Signal 
Reconstruction  from  Signed  Pourier  Transform  Magnitude,"  IEEE  Trans. 
Acouat.,  Speech,  and  Signal  Processing,  ASSP-31,  pp.  1286-1293,  Oct. 
1983.  (Piche  84,  pp.  3-1  to  3-8.) 

M.H.  Hayes  and  T.F.  Quatieri,  "Recursive  Phase  Retrieval  Using 
Boundary  Conditions,"  J.  Opt.  Soc.  Am.,  Vol.  73,  pp.  1427-1433,  Nov. 
1983.  (Piche  84,  pp.  3-9  to  3-15.) 

M.H.  Hayes,  "The  Representation  of  Signals  in  Terms  of  Spectral 
Amplitude,”  Proc.  1983  Int.  Conf.  on  Acoust.,  Speech,  and  Signal 
Processing,  pp.  1446-1449,  April  1983.  (Piche  84,  pp.  3-16  to  3-19.) 

Multiprocessor  Architectures  for  Digital  Signal  Processing 

T. P.  Barnwell,  HI 

"Optimal  Implementation  of  Plow  Graphs  on  Synchronous  Multi¬ 
processors,"  T.P.  Barnwell,  III,  and  D.A.  Schwartz,  Proceedings  of 
Asilomar  Conference  on  Circuits  and  Systems,  November  1983.  (Piche 
84,  pp.  4-1  to  4-7.) 

Two-Dimensional  Optical  Storage  and  Processing 
T.K.  Gaylord 

Moharam,  M.  G.  and  Gaylord,  T.  K. ,  "Rigorous  Coupled-Wave  Analysis  of 
Grating  Diffraction  —  E  Mode  Polarization  and  Losses,”  Journal  of  the 
Optical  Society  of  America,  vol.  73,  pp.  451-455,  April  1983.  (Piche 
84,  pp.  5-1  to  5-5.) 

Moharam,  M.  G.  and  Gaylord,  T.  K. ,  "Three-Dimensional  Vector  Coupled- 
Wave  Analysis  of  Planar-Grating  Diffraction,"  Journal  of  the  Optical 
Society  of  America,  vol.  73,  pp.  1105-1112,  September  1983.  (Piche 
84,  pp  5-6  to  5-13.) 


Baird,  W.  E. ,  Moharam,  M.  G. ,  and  Gaylord,  T.  K. ,  "Diffraction 
Characteristics  of  Planar  Absorption  Gratings,"  Applied  Physics  B, 
vol.  32,  pp.  15-20,  September  1983.  (Piche  84,  pp.  5-14  to  5-19.) 

Moharam,  M.  G.,  Gaylord,  T.  X.,  Sincerboac,  G.  T.,  Werlich,  H.  and 
Yung,  B. ,  "Diffraction  Characteristics  of  Surface-Relief  Dielectric 
Gratings,"  (Abstract)  Journal  of  the  Optical  Society  of  America,  vol. 
73,  pg.  1941,  December  1983.  (Piche  84,  pp.  5-20.) 


Mo ha r a*,  M.  G.  and  Gaylord/  T.  K.,  "Diffraction  of  Finite  Beams  by  Di¬ 
electric  Gratings/"  (Abstract)  Journal  of  the  Optical  Society  of 
America/  vol.  73,  pg.  1941,  December  1983.  (Fiche  #4,  pp.  5-20.) 

MirBalehi,  M.  M. ,  Guest,  C.  C. ,  and  Gaylord,  T.  K.,  "Optical  Truth- 
Table  Look-Op  Processing  of  Digital  Data,"  (Abstract)  Journal  of  the 
Optical  Society  of  America,  vol.  73,  pg.  1951,  December  1983.  (Fiche 
#4,  pp.  5-21.) 

Baird,  W.  E. ,  Gaylord,  T.  K.,  and  Moharam,  M.  G. ,  "Diffraction 
Efficiencies  of  Transmission  Absorption  Gratings,"  (Abstract)  Journal 
of  the  Optical  Society  of  America,  vol.  73,  pg.  1889,  December  1983. 
(Fiche  #4,  pp.  5-22.) 

Mirsalehi,  M.  M. ,  Guest,  C.  C. ,  and  Gaylord,  T.  K. ,  "Residue  Number 
system  Holographic  Truth-Table  Look-Op  Processing:  Detector  Threshold 
Setting  and  Probability  of  Error  Due  to  Amplitude  and  Phase 
Variations,"  Applied  Optics,  vol.  22,  pp.  3583-3592,  November  15, 
1983.  (Fiche  #4,  pp.  5-23  to  b-32.) 

Guest,  C.C. ,  "Holographic  Optical  Digital  Parallel  Processing,"  Ph.D. 
Thesis,  Georgia  Institute  of  Technology,  November  1983.  (Fiche  #4, 
-pp.  5-33  to  5-68  and  Fiche  #5,  pp.  5-69  to  5-166  and  Fiche  #6,  pp. 
5-167  to  5-184.) 

Hybrid  Optical/Dig ital  Signal  Processing 
W.T.  nodes 

J.N.  Mait  and  N.T.  Rhodes,  "Dependent  and  Independent  Constraints  for 
a  Multiple  Objective  Iterative  Algorithm,"  in  Signal  Recovery  and 
Synthesis  with  Incomplete  Information  and  Partial  Constraints 
(Technical  Digest)  (Optical  Society  of  America,  1983),  pp.  THA14-1 
through  THA14-4.  (Fiche  #6,  pp.  6-1  to  6-4.) 

W.T.  Rhodes,  A.  Tarasevich,  and  N.  Zepkin,  "Complex  Covariance  Matrix 
Inversion  with  a  Resonant  Electro-Optic  Processor,"  in  Two-Dimensional 
Image  and  Signal  Processing,  G.  Morris,  ed.  (Proc.  SPIB,  Vol.  388, 
1983),  pp.  197-204.  (Fiche  #6,  pp.  6-5  to  6-12.) 

W.T.  Rhodes  and  M.  Koizumi,  "Image  Enhancement  by  Partially  Coherent 
Imaging,"  in  Proceedings  of  the  10th  International  Optical  Computing 
Conference  (IEEE  Computer  Society,  1983,  IEEE  Order  No.  83CH1880-4), 
pp.  32-35.  (Fiche  #6,  pp.  6-13  to  6-16.) 


W.T.  Rhodes,  "Hybrid  Time-  and  Space- In teg ration  Method  for  Computer 
Holography,"  in  International  Conference  on  Computer-Generated  Holo¬ 
graphy,  S.  Lee,  ed.  (Proc.  SPIB,  Vol.  437,  1983),  pp.  xx-xx.  (Fiche 
#6,  pp.  6-17  to  6-22.) 

W.T.  Rhodes,  "Acousto-Optic  Algebraic  Processors,"  in  Real-Time  Signal 
Processing  VI,  K.  Bromley,  ed.  (Proc.  SPIE,  Vol.  431,  1983),  pp.  xx- 
xx.  (Fiche  t6,  pp.  6-23  to  6-33.) 

H.J.  Caulfield,  J. A.  Neff,  and  W.T.  Rhodes,  "Optical  Computing:  The 
Coming  Revolution  in  Optical  Signal  Processing,”  Laser  Focus/Electro- 
Optics  Magazine,  November  1983,  pp.  100-110  (invited).  (Fiche  #6,  pp. 
6-34  to  6-42.) 


THEORY  MB)  JUDICATIONS  CT  SUCTRCMAOIETIC  MEASUREMENTS 

Electromagnetic  Measurements  in  the  Time  Data  in 
6.8.  flnith 

G. S.  Smith  and  L.N.  An,  "Loop  Antennas  for  Directive  Transmission  into 
a  Material  Balf  Space,"  Radio  Science,  vol.  18,  no.  5,  pp.  664-674, 
Sept. -Oct.  1983.  (Fiche  #7,  pp.  7-1  to  7-11.) 

H. I.  Bassen  and  G.S.  Smith,  "Electric  Field  Probes  -  A  Review," 
(Invited  Paper),  IEEE  Trans.  Antennas  and  Propagation,  vol.  AP-31,  no. 
5,  pp.  710-718,  Sept.  1983.  (Fiche  #7,  pp.  7-12  to  7-20.) 

G.S.  Smith,  "Directive  Properties  of  Antennas  for  Transmission  into  a 
Material  Half  Space,"  IEEE  Trans.  Antennas  and  Propagation,  vol.  AP- 
32,  no.  3,  pp.  232-246,  March  1984.  (Also  presented  at  the  1983  IEEE 
Antennas  and  Propagation  Society  International  Symposium  and  National 
Radio  Science  Meeting  (ORSI),  Houston,  TX,  pg.  7,  May  1983.)  (Fiche 
#7,  pp.  7-21  to  7-35.) 

G.S.  Smith,  "Limitations  on  the  Size  of  Miniature  Electric  Field 
Probes,"  IEEE  Trans.  Microwave  Theory  and  Techniques,  volume  MIT-32, 
no.  6,  pp.  594-600,  June  1984.  (Fiche  #7,  pp.  7-36  to  7-42.) 

G.S.  Smith,  "Loop  Antennas,"  in  Antenna  Eng ineer ing  Handbook ,  (R.C. 
Johnson  and  H.  Jasik,  Eds.,  New  York:  McGraw-Hill,  pp.  5-1  to  5-24, 
1984.  (Fiche  #7,  pp.  7-43  to  7-67.) 

Automated  Radiation  Measurements  tot  Near  and  Par-Field 

Transformations 

B.B.  Joy 

V.V.  Jory,  E.B.  Joy,  and  W.M.  Leach,  Jr.,  "Current  Jlntenna  Near-Field 
Measurement  Research  at  the  Georgia  Institute  of  Technology," 
Proceedings  of  the  13th  European  Microwave  Conference,  Nurnberg,  West 
Germany,  September  5-8,  1983,  pp.  8-23,  8-28.  Fiche  t7,  pp.  8-1  to 
8-6.) 


E.B.  Joy,  "Spherical  Surface  Near-Field  Measurements,"  Proceedings  of 
the  Antenna  Measurement  Techniques  Association  1983  Meeting, 
Annapolis,  MD,  September  27-29,  1983,  pp.  23-1,  23-8.  (Fiche  #7,  pp. 
8-7  to  8-12.) 


The  last  five  pages  of  Fiche  #7  contain  the  above  list  of 
publications. 


IT'KATItf  uECOhm'lTIOS  USlhw  SEKKAL  DtffEkEhf  tiSfiMTEu  ktUSlUhS 
Of  M  UhhhOdH  SIGhAl 

Aggelos  K.  kattaggelot  and  Ronald  w.  Schafer 


School  of  Electric*)  Engineering 
Georg’*  Institute  of  Technology 
At  lent*.  Georg l«  JOijf 


abstract 

Jn>s  caper  analyses  tne  error  ben a. tor  of 
’t*rat**e  deconvolution  algorithms  when  tne  dis¬ 
tort  t«j  syst en  n*s  *  f resjoency  response  fiat  nas 
’e:at*ve  real  pert  or  has  *  finite  number  of 
•su’ated  eeros.  Tne  e»'stence  of  ti.es-  teros  at 

*  !•«'?«  n„"iOer  of  discrete  freqjenc'ev  results 
>n  an  ^ability  of  tne  aecsnvol ut 'on  *lgO'<tnm  to 
restar-  the  lost  information  at  tn«<~  frequencies 

•  •tn  a  snail  nu«oer  of  Herat ’«,ns.  *  nee 

j'g.rtthm  Is  suggested  that  <ncorpc.'ates  n.tttple 
j’Storted  «'.rs*ons  af  tne  signal  >•  i  results  »«  a 
'estjrat'on  error  mat  app'aicne-  ■•-•ra  w’tn  a 
s-jll  r.u-o*r  of  iterations. 

UhC 0*1  S1KA IHtO  HERAT  Iff  OECOhfO.UTIgh 

!«  general  an  appropriate  *.*tf-.ma;>tai  re¬ 
presentation  far  a  distorting  system  s 

y  •  0  «  l la) 

■ne'e  *  *s  tne  jnano.n  input  Signal,  y  *i  the 
ant,n  output  signal  and  0  <s  a  «no«n  d'startlon 
operator  or  transformation.  A  standard  technique 
far  finding  a  solution  to  Eq.  (1*)  Is  based  upon 
me  iteret'on  equation 

Vl  *  *  »  4  U*'0)«a  Ub) 

■ne'e  I  Is. the  identity  operator  and  <  is  a  co«- 
eerjence  parameter  that  must  be  chosen. 

For  the  cless  of  linear  shift  invariant 
d'Start*ons 

y  •  h  •  i  (?) 

and  i(")  can  be  found  iteratively  using  the 
algorithm, 

«q(")  •  *  h  (-n)  •  y(n) 

•ne'e  •  denetes_convolutien,  *  denotes  complca 
conjugation  and  h(n)  denotes  an  approaiaatlon  to 
tne  impulse  response  of  the  distorting  or  blurr- 
•ng  system  h.  This  elgorithm  is  henceforth  re¬ 
ferred.  to  as  algorithm  *1.  Tne  convolution 
w«tn  n  (-n|  in  the  ebove  algorithm  has  been  In- 
cluoed  In  order  to  ensure  convergence  of  the 

‘tins  work  was  supported  by  the  Joint  Services 
Electronics  Program  under  Cnntrect  fDAAUt-ll-(> 
0 Oil. 

elgorithm  when  the  Fourier  transform  of  h(n)  MS 
a  negative  reel  part-tl). 


U\  ■  >g  frequency  domain  notation,  if  .- : 

<.{.■  represent  tne  fourler  transform  .  •  t-ve 
orig.i^i  and  restored  signal  after  a  it'.  >u' •. 
resp-r. *»vely.  then  It  is  easily  shown  that 

>,  ■  „)  .  .  HLd  tl  -  L1  -  a  «*<„) 

»  (->  *W 

■here  <•)  ana  B(-)  represent  the  fou'i,  .r ,  .. 
form  of  me  impulse  response  of  the  ■  k  i 
olurring  system  and  Its  appr.,’  i  ■  r, 
respectively. 

Using  the  aboue  notation,  tne  spec  • 
tne  restoration  error  of  the  k-tn  Hera'  ■/ 
be  written  as 

from  equation  (4)  we  ouse-ve  that 

imi|.M.|/SW  si 


whenever 

|l  -  *  S^-ll  <  1.  1-1  <  r/T  (7) 

In  this  case  the  Iteration  can  be  gua-riteed  to 
Converge  to  a  unique  solution  ( 1  j .  It  ’•  us'.umed 
that  h(.)  approximates  d(-)  as  close  s-.  possible, 
so  that  tne*r  ratio  approaches  one  so  that 
according  to  equation  (4),  the  err,.-  spectrum 
approaches  it'o. 

At  frequencies  Where  h(.)*U  .  Inequality  (7) 
Is  not  satisfied  (the  operator  (I-  *0)  is  not  a 
Contraction  anymore  Out  It  Is  Simply  none»p*ns*ve 
(TJ).  In  this  case  It  can  be  see"  from  equation 
(«)  that  Hk  (•>)•<>.  at  these  frequencies  and 
according  to  equation  (S),  Tnus. 

with  an  Infinite  number  Of  iteration,  '.nw  con- 
t*nuous  error  spectrum  will  approach  Zero  every¬ 
where  eicept  for  e  finite  number  of  frequencies 
where  H(w|»0  .  At  this  discrete  set  of 
frequencies  the  error  spectrum  Is  equal  to  the 
signal  spectrum.  It  can  be  argued  tnat  because 
the  error  spectrum  differs  from  itro  on  a  set  of 
aero  Masm-e,  perfect  restoration  can  be  achieved 
with  an  Infinite  number  of  Iterations  [i).  In 


I1"-1  -'•V-.-’-l 


practical  'mplementat ions  though,  only  *  f*n*te 
njO*r  of  iterations  can  be  con sidered.  In  this 
CM  to*  err  a'  spectrum  has  the  form  Of  4  tram 
of  pulses  centered  on  the  locations  of  the  aer os 
of  n(.)  t  «ni)  for  the  co*«>n  cist  of  periodic 
pulses  in(-)  has  equally  spaced  aeros)  the 
ret j  1 1  mg  error  in  the  time  domain  hit  4  periodic 
nature. 

Tout,  according  to  the  above  discussion  the 
pitting  of  the  0'*3'r.*t  input  signal  through  the 
Marring  System,  results  *0  absolute  lOSS  Of 
Hi  I>»i!'.h  it  freguencies  -here  .nf-)*0  ,  so  tint 
t-.e  r  >'..ergence  of  tne  unconstrained  iterative 
>:.>h*  )lut*  jn  algorithm  ‘s  slowed  down.  In  order 
t)  sped  ,.p  the  convergence  of  tne  algorithm  this 
lost  ••ifj'-at*oh  -ust  Oe  incorporated  *n  tne 
jlgor *tnm. 


la  *’. loStrate  tne  above  result,  consider  an 
4p5r;,tnat -cn  to  the  impulse  response  of  tne 
J'Stofn;  S/Stem  Of  tne  for* 

n  .  n*u,1 . h- 1 

n>)*-'  (d) 

0  .  other«‘se 

‘  -pulse  -»spanse  (o'  >ls  inaloj  rounterpert) 
>s  i  jS-'u’  '.'lei  f  or  many  physical  distort  ions; 

*;•  •  jn  olyr  *n  'mages.  the 

•.ejuence  'jh)  (H»1j;  .as  convolved  w'tn  an 
■  -O--  tri’h  of  tf e  f pr* 

«.  '.*2'!n-S&)'5:;a-Sv)*;'n-7U)  19) 

f'j.  t : o .  sno»s  the  s;»ctrun  of  t(n)  and  fig. 

l.S,  tie  spectrjh  o<  the  restored  Signal 

alter  loud  'te'lt<0"S  (h(n)*  n(n|  and  •  *1.S). 
tne  eiis’crse  of  tne  per'od'c  leros  is  »ery 
.lea-!/  sVo«n.  tne  resalfng  restoration  t"o' 
v-“.  'ts  scectruh  a-e  j*n  in  fig.  I(Ci  and  1(d), 
respectively. 

A  KU  UCOhvUI  Ul  IUN  ALCOKItHH 

’he  oas'c  *lea  5en'ni  the  proposed  ne* 
iterat‘.e  Cetonvolot *on  algorithm  is  the  incor¬ 
porat'd"  of  tie  lost  information  at  the  specific 
f'eguencies  «nere  ns-i  is  aero,  by  a  second 
ulor-ed  .ers'cn  of  the  same  original  signal. 

According  to  this  algorithm,  the  Ire 
un«no«n  signal  *s  input  to  two  different  distort¬ 
ing  systems.  That  is, 

y,(")*01«(")*n|(n)  •  «(n) 


■  ,,,(h)««ii(n)*i,hl  l-nj*.y,ln)-nj(n|e«t(n) 

♦  i^^'t-njely^tnj-n^tn'-a^tn),  (II) 


where  convolution  w<tn  h,  (-n)  and  n,  (-n)  nas 
been  Included  again  in  order  to  ensure  con¬ 
vergence  of  t  he  algorithm  when  the  fourier 
transforms  of  h.(n)  and  n  (n)  do  nnt  have 
nonnegative  real  1  parts.  ee  'efer  to  inis 
algorithm  as  algorithm  fir. 


In  this  case  using  frequency  dvr.a'o  notation 
•t  can  Be  Shown  that 


v*> 


;l-_:  n1‘iu).«/R^(. )/*■'•  (1C) 


where  H,  (  .  .-..id  •*,(-)  ,  1*1,7  represent  tne 

Fourier  lra> sfc- n  of  tne  impulse  response  of  tne 
blur  mg  system  and  Its  approiinat‘o«, 
respectively. 

from  Eg.  .(17)  ‘t  *S  easily  seen  that 
when  n.{.)  and  rt  (.)  have  no  connun  aeros.  the 
term  »fcs‘de  the  Vacaets  can  always  be  made  less 
than  one,  with  appropriate  choice  of  tne 
parameters  •,  and  .  and  consequently  it 
approaches  rtro  for  all  frequencies  when  raised 
to  tne  .  t*t)  power  for  a  large  number  of 
iterations.  Conceptually  this  means  that  tne 
in'jrnat'on  of  the  original  spectrum  »(«)  that  ‘S 
lost  at  me  frequencies  where  «.(„)  Is  aero,  is 
provided  to  the  algorithm  by  h‘(. )  ,  and  vice 
versa,  tops,  me  algorithm  described  by  Ed.  (H) 
converges  to  a  unique  solution  «ucn  faster  than 
the  algo'imn  eescrioed  by  Eq.  (4),  and  tne  spec¬ 
trum  of  '.he  restored  signal  »t(*)  Is  smoother, 
the  restoration  error  mat  results  w*m  me 
application  of  the  algorithm  *2  has  a  form  that 
*S  tne  superposition  of  the  errors  that  wou'fl 
nave  resulted  n.l«l  a«d  «,».)  had  been  applied 
separately,  but .approaches  aero  much  faster.  In 
the  case  that  ",(.)  and  «.,(*)  have  one  o'  more 
aeros  In  common,*  the  spectrum  of  the  restored 
signal  *,(*)  evhibits  a  larger  error  at  the  fre¬ 
quencies  of  the  con-on  aeros  but  aga*n  algorithm 
it  performs^  much  Or. ter  than  algorithm  *1  with 
the  use  of  Hj(u)  or  Mj(w)  . 


y?'n)*0/.(n). h^(n)  •  «<n)  (10) 

where  yj(n)  and  yp{n)  are  known  output  signals 
and  h,(nj  and  h^fnf,  tne  impulse  responses  of  me 
two^  distorting  systems  wnio  are  approiimated 
by  i'll")  and  «,(")  respectively.  Then  the 

original  s'gnal  «(n)  can  be  recovered  from  yj(n) 
and  y^(n)  by  us’ng  the  algorithm 

.  lalu  k.'luill.  («i  *  »_S  * 


the  choice  of  the  parameters  t,  a«d  »,  ‘s 
determined  by  the  requirement  that  tne  operator 
(I-  *iDi-  i;0;)  must  be  a  contraction  in  order 
for  the* unconstrained  algorithm  to  converge  to  a 
unique  solution.  Tne  operator  (I-  iilj-  ** 

defined  by  an  equation  similar  to  tq.  (lo)  w’tn 
the  use  of  tq.  (10).  It  <s  easily  shown  mat 
this  implies  that 

.  a  2,  ,  .  a 


(*n)«y?(n) 


l> 


l 


If  hj(r.)  and  S^(a)  ft  normalized  to  tint 

S,|l»*S;(i)}*i  St(")*.  K2(«j«l  (14) 


inequality  (13)  leqilles  tint  i,  and  i,  must  lie 
Inti  at  a  triangle  tmt  It  formed  by 
tne  *j  and  a»<s  and  the  line 

to  illustrate  t nt  effectiveness  of  the  «*• 
ilgu'iinm.  consider  appo* ‘net lout  to  the  impulse 
responses  of  tne  distorting  systems  of  the  for* 
described  by  Eg.  (8),  »nere  toy  ii.(n)  M*I6  and 
;-\r  n,(»)  y.*i.  tne  same  sequence  descrloed 
by  liL.  .f,  't  used  m  tut  evample.  Hue  to  the 
facts  tn»;.  a)  the  it'O  of  «,(-)  for  -*T*/d  It 
very  close  to  the  zero  of  n,(.)  for  -«d«/9  , 
.o*ie  tne  otner  zeros  ft  qo'te  f «r  apert,  and  b) 
tne  values  of  tne  speccra  of  the  oturred 
signals  f,i.'  and  »,{. )  ft  dote  to  zero  for 
f'i’O vei. 'it  jroun j  . ■  >• : s  ,  tne  effect  of  tmt 
"|t;rjst*  soman  tero  on  tne  spectrum  of  tne  r*. 
st  >»«d  signal  <t  tnoun  ‘n  fig.  2(<J  far 

WJ  „  'terat'jnt,  and  n.(n)»hj(n), 

h;in;*h,(n)  .  It  *t  ver}  clear  tnat  tne  spectrum 
'h  F>g.  2(a)  ‘t  much  vnootne'  and  dote  to 
tie  jrij.njt  one  (F'g.  t(a)l,  tnan  the  one  snoun 
•n  fig.  l,a)  for  tne  taie  number  of  Iterations. 

Corresponding  rettorat  *on  "error  that  it  IhOnh 
•h  fig.  2(o;.  *t  tt*ll  periodic  and  *t  can  be 
-.“eh  to  St  much  smaller  than  the  error  snonh  ‘n 

F’g.  i;o. 

Tne  mean  squared  rettorat*on  error  (USE) 
tetui'en  the  o'lg’njl  t'gnal  •  ( n ) .  and  the  rettor- 
e a  t’gnal  «k(n),  hat  been  cnoten  at  a  criterion 
fo'  co*?a'*hg  the  effect ‘ renett  of  the  t»o  decon- 
oi.t'on  alji'ithnt  ana'ytea  *n  tmt  paper.  Tne 
tinj’at’on  r.dultt  tnat  nare  been  obtained  for 
pom  tne  decoruol  ution  algorithms.  ut'ng 
different  lengtn  blurring  funct'ont.  are  ihonn  in 
fig.  3.  Cac"  curve  In  tmt  figure  repretentt  the 
IS;  *n  a  logarithmic  tcale  at  a  function  of  the 
nunoer  of  Herat  iont.  Tne  curvet  labeled  A  i"J  8 
•ere  '.stained  by  application  of_the  algorithm 
•  I.  In  both  cases  »»l.b  .  h(n)*h(n)  ,  and  the 
blurring  function  nas  described  by  eq.  (8).  For 
curve  A,  if*  12,  and  for  curve  8,  N«9.  Tne  curves 
labeled  C  and  D  ne'e  obtained  by  application  of 
the  algorithm  ti.  In^  both  cases  8  , 

«j(n)*  n,(n)  ,  hj(n)*  hp(n)  ,  the  1  blurring 
»  functions  »ere  described  by  Eq.  (8),  and 
for  n,(n)  if  «as  equal  to  18.  The  parameter  ft 
(eq.  8)  for  the  olurrlng  function  h.(n)  .as  equal 
to  9  for  curee  C  end  equal  to  12  for  curve  0. 

The  much  better  result  represented  by  the 
curve  labeled  0  Is  due  to  the  fact  that  the  fre¬ 
quency  responses  of  the  tuo  blurring 
systems  H,(.)  a»d  n^u)  have  teros  close  to  each 
otner  at  ion  frequencies  and  zeros  far  apart  from 
each  other  at  high  frequencies  uhere  the  anpl I- 
tude  of  the  spectra  of  the  blurred  slgnels  f<(  •) 
•no  Ty(  *)  are  small.  Conceptually  this  means 
that  the  term  inside  the  brackets  in  eq.  |I2)  is 


considerably  less  than  one  for  all  frequencies, 
because  for  the  one  undesirable  case  of  close 
teros  (Inn  frequencies)  the  spectra 
of  K,(.)  and  3,(.)  have  e  large  amplitude  untie 
for  the  other  ‘undesirable  case  of  small  spectral 
amplitude  (high  frequencies)  the  zeros 
of  fi,(u)  and  3,(e)  ft  far  apart  Iron  each 
other. Similar  results  have  been  obtained  min  tne 
use  of  image  data. 

DISCUSSION  ANO  CONCLUSIONS 

The  Improvement  thet  H  achieved  by  the  ne. 
algorithm  «Hh  respect  to  tne  mean-squa'ed 
restoret-on  error.  Is  on  the  order  of  oB  for 
2uuO  iterations,  as  It  c'.n  be  seen  >n  F»g.  (a), 
unite  the  computational  effort  H  sHgntly 
bigger.  Another  Important  feature  of  tne  ne. 
algorithm  H  that  it  can  be  applied  i«  cases  «nen 
tne  inverse  filter  does  not  eaist.  Also  note 
that  the  algorithm  described  by  Eq.  (II)  is 
different  from  the  results  from  the  algorithm 
described  by  _  Eq.  _  (3),  if  «e 
replace  n(n)  «Un  s.h.ln).* ,n.(n)  and  h(n) 

■  »th  *.h  (n).u  h  (n)  f  Even*  though  the  Fourier 
transform  of  Tne  linear  cpmbinat <on  of  impulse 
responses  mill  _not  be  zero  at  tne  zeros  oi 
rimer  n,(.)  or  S  t -)  ,  tnere  is  no  guarantee 
tnat  i«-*,h,f.)  «HI  not  have  zeros  at  other 

local >4ni  on  (if  un*t  circle.  Even  H  all  tne 
resulting  zeros  are  off  the  unH  circle,  the 
System  may  not  be  minimum  phase  ana  therefore  ‘t 
a’ 1 1  not  be  possible  to  o»te‘n  a  stable  inverse 
filter. 

The  cese  »n  «nUh  constraints  hare  been 
Incorporated  *n  the  «ea  algor‘tnm,  can  be 
analysed  as  descriped  »n  the  paper  p,  Schafer, 
at  al  ,1J.  Obviously  in  this  case  since  the 
operator  (|-  *,D|»»  0,  )  it  a  contraction,  the 
constraint  operator  need  only  be  nonerpanamg 
in  order  for  tneir  product  to  be  a  contraction. 
Some  Initial  results  are  snonn  in  fig.  (4).  Tne 
Incorporation  of  a  posHIvt'y  constraint 
results  *n  a  much  smaller  NS  uHh  the  use  of 
both  algorithms,  but  still  tne  algorithm  ti  per¬ 
forms  better,  as  can  be  see*  by  comparing  the 
Figures  (4)  and  (4). 

Further  research  ulll  involve  an  optimum 
choice  of  the  ver'able  parameters  ».  ana  *  ,  in 
order  to  speed  up  to  convergence  of  tne 
algorithm;  frequency  domain  constraints;  the 
effect  of  random  additive  no  He  in  the  inputs  ano 
application  of  algorithm  a2  to  the  shift-varying 

czse. 
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F.3  1.  Miuiv*t  on  o<  the  per  ckJ'C  na:u  e  of  the  rotOfil'On  hro»  lal  Sji«tfum  of  t?  »  wquwce  *fnl  of  Eq  (91. 
ib'  Si'ect'um  o*  the  restored  113^*1  eft  e*  1000  ite#et'Ons  &nd  >*18  (cl  Reiio»at'0«  cfroc.  Note  that  the  error 
at  tr*e  ;pcat>on  of  the  imp u'wt  fd(  Spectrum  of  the  sequence  in  F  iqure  V*cl. 
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F 19.  2.  The  cite  that  the  frequency  responses  of  the  d*stort*n«t  systems  have  one  'almost*  common  aero, 
la)  Spectrum  of  the  restored  s-ytal  after  1000  iteration*  and  *,  Ibl  Restoration  error. 
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AISTRACT 

The  extrapolation  of  a  band  United  signal 
frou  observat ions  taken  over  a  finite  Interval  is 
an  important  problem  In  applications  such  as 
spectral  estimation  and  Image  processing.  The 
eatrapolation  of  bendlimited  analog  signals  is 
fundamentally  different,  however,  from  the  band- 
limited  extrapolation  of  discrete  time  sequenc¬ 
es.  Specifically,  as  has  been  previously  noted, 
a  unique  solution  to  the  discrete  problem  does 
not  exist.  In  this  paper,  this  fact  is  demon¬ 
strated  in  a  new  and  convincing  way.  1"  particu¬ 
lar,  two  methods  are  presented  for  constructing  a 
sequence  ahtc".  In  the  frequency  domain,  is  band- 
limited  to  an  arbitrary  cutoff  frequency  and 
wn':h.  fn  the  Ctme  domain.  Is  equal  to  zero  over 
f.  interval  of  arbitrary  length.  The  importance 
of  the  existence  of  these  multiple  solutions  to 
the  general  extrapolation  problem  is  enpnas wed 
and  questions  are  raised  regarding  the  need  for 
additional  constraints  in  the  discrete  band- 
limited  extrapolation  problem. 

INTRODUCTION 

Considerable  Interest  has  recently  been 
focussed  upon  the  problem  of  computing  values  of 
a  band  limited  signal  given  knowledge  of  the 
Signal  over  a  finite  time  interval.  This  band- 
limited  eatrapolation  problem  has  been  studied  by 
Pepoulls  [1],  Jain  and  Ranganath  [2]  and  many 
Others  too  numerous  to  mention  here,  Hhen  formu¬ 
lated  in  terms  of  continuous-time  bendlimited 
Signals,  a  unique  solution  has  been  shown  to 
eilst,  and  an  algorithm  has  ieen  given  for 
obtaining  that  solution  [1],  However,  most 
Implementations  of  the  algorithm  are  discrete; 
t.e.  they  seek  to  compute  samples  of  the  band- 
limited  signal  from  a  finite  set  of  its  samples. 

Although  it  Is  not  surprising  that  a  band- 
limited  signal  cannot  be  uniquely  determined  by  a 
finite  set  of  its  samples,  there  Is  still  consid¬ 
erable  Interest  In  the  discrete  band! lotted  ex¬ 
trapolation  problem.  In  this  paper  we  discuss  a 
property  of  sampled  bendlimited  signals  which 
sheds  light  on  the  uniqueness  problem  in  discrete 
bendlimited  extrapolat Ion. 


*  This  work  was  supported  by  the  Joint  Services 
Electronics  Program  under  contract  P0AAG29-81-X- 
0024. 


MCKtOOMB 

Consider  a  sequence.  x(n)  •  xa(nT),  obtained 
by  sampling  a  bandltmited  analog  signal  xa(t). 
If  the  smnpling  rate  is  hgh  enough,  the  discrete¬ 
time  Fourier  transform  of  x(n)  will  have  the 
property 

*  )  x(n >e‘j“"  *  0  «e  «  w  «  <  .  (1) 

By  analogy.  x(n)  can  be  termed  a  frequency  band- 
limited  sequence.  In  implementing  discrete  band- 
limited  extrapolation,  It  is  necessary  that  the 
sampling  rate  be  high  enough  to  result  In  a 
bandltmited  sequence  of  samples  so  that  the 
Dandtimiting  constraint  can  be  applied  using  * 
discrete  low-pass  filter. 

.  For  continuous-time  signals,  the  time-domain 
property  that  is  analogous  to  bandllmitedness  is 
time  limitedness.  Due  to  the  dual  nature  of  the 
direct  and  inverse  Fourier  transform  relations  in 
the  continuous  time  case,  properties  of  band- 
limited  and  ttmeltmtted  signals  also  stand  in  a 
dual  relationship  to  one  another.  For  example, 
if  the  signal  is  frequency  bandltmited.  the  cor¬ 
responding  time  funct.on  is  an  analytic  function, 
and  if  the  time  function  Is  time  limited,  then 
the  corresponding  Fourier  transform  is  an  analyt¬ 
ic  function.  It  is  this  property  in  fact  which 
suggests  the  possibility  of  extrapolation  i.i 
either  the  time-domain  [1.2]  for  frequency  band- 
limited  signals,  or  in  the  frequency  domain  [3] 
for  time-limited  signals. 

As  a  direct  consequence  of  the  .wielyttctty 
properties  discussed  above,  we  have 

Property  Cl:  A  conti pious- time  signal  cannot  be 
both  time- limited  and  frequency 
bandltmited. 

or.  more  generally. 

Property  C2:  A  continuous-time  signal  and  Its 
Fourier  transform  cannot  both  be 
identically  tero  over  intervals  of 
any  length  (t). 

For  discrete  signals,  there  is  an  important 
difference.  The  time-domain  representation  is  a 
sequence  rather  than  a  function  of  a  continuous 
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variable.  See lust  of  this,  tht  dull  relationship 
between  the  tine- domain  end  frequent/  dooilns 
cinnot  be  guaranteed. 

ramriES  of  imolimted  sequences 

An  fades- Halted  sequence  is  defined  to  be 
such  thit 

i(a)  •  0  for  n  <  n^  end  fl  >  nft.  (2) 

cieirly  we  tin  tike  n  •  1  end  iw  •  N  without 
lost  ©f  generality.  if  x(n)  Is  indei-1 inited. 
then  its  discrete-t lor  Fourier  trmsfom  is  easl- 
1/  seen  to  be  I  finite  trigonometric  polynomial 
Of  order  b-1,  ind  therefore  it  eiy  hive  it  most 
N-1  isolated  leros  in  the  intervil  |w|  «  •.  Thus 
we  hive  analogously  to  Property  Cl. 

Property  01:  A  discrete-tine  signal  cinnot  be 
both  inaea- 1  lotted  end  frequency 
binol lolled. 

However,  there  Is  no  analogous  property  to  Pro¬ 
perty  C2.  Indeed  we  cin  state  thit 

Property  02:  It  Is  alnays  possible  to  find  I 
sequence  which  is  identically  zero 
over  an  arbitrary  finite  Interval 
and  whlcn  is  bmdllnlted  to  my 
desired  frequency  band. 

In  order  to  verify  Property  02,  we  will  give  a 
constructive  proof.  Specif ically.  we  will  show 
that  a  bandlinlted  sequence,  g{n).  nay  always  be 
constructed  in  such  a  way  tnit  CieJ''j  *  0 

for  W  <  |w|  «  V  ana  g(n)  -  0  for  1  «  n  «  N 
where1  w£  and  h  are  arbitrary. 

Kith  f  (t)  an  arbitrary  analog  signal  which 
Is  band  United  to  [-w>£,w£J  ,  define  f4(q)  as 


f,<q)  ■  j“C  F^-le'^d-  (3) 

-w 


vwicre  q  •  w*Jt  Is  a  comp  lev  variable,  hote  that 
f*(t)  •  ?4{-Jt)  .  I.e..  f4(t)  equals  f4(q) 

along  the  Imaginary  ails.  In  addition,  note 

thit  f4(q)  is  the  Laplace  transfora  of  the  func¬ 
tion  f  (u)  .  Thus,  since  Ft(«l  has  finite  sup¬ 
port,  f4(q)  is  an  entire  function  of  eiponential 
type.  Therefore,  by  the  Hacanard  factorization 
theoren  [S],  f4(q)  nay  be  written  In  an  Infinite 
product  expansion*. 

f  .(q)  •  A  qV«  *  (1  -  | )  e^"*  (4) 

k«l  Tl 


white  the  q. 't  denote  the  (non-zerol  zero-, 
of  f.(q)  ,  A  Ts  the  order  of  the  zero  of  f4(q)  at 
q  •  0,  and  where  A  end  a  denote  coat  lex  con¬ 
stants.  The  following  Is  a  property  of  entire 
functions  of  eiponential  type. IS] 

Property  A:  let  f.(t)  be  a  bandlinlted  signal 
whose  Fourier  transfora  vanishes 
outside  the  interval  l-w  ,  w.j.  and 
let  f4(q)  be  the  analytic  continua¬ 
tion  of  f.(t)  into  the  coupler  pune 
as  defined  In  (3).  Then  the  drle- 
tlog  of  a  finite  nmoer  of  z  *ros 
of  f.(q)  or  the  replacement  of  a 
finite  set  of  N  zeros  of  f4(q)  with 
another  set  of  h  zeros  yields 
mother  bandlinlted  signal  qa(t) 
whose  Fourier  transfora  vanishes 
outside  the  interval  [*“e.  “CJ  . 

As  f  result  of  this  property,  consider  the 
slgn|l  yq)  which  is  obtained  by  novtng  h  zeros 


of  f 


•J1 


as  follows: 


94(p)  *  FtU) 


A 

I 

k-1 


.  ft(q)  h4(q)  (S) 


Tnus,  the  effect  of  h({q)  is  to  nove  the  h 
zeros,  ioj.o? . Oq)  of  f.lo)  to  »  points  equal¬ 
ly  spaced  along  the  Imaginary  ails.  I.e..  q  •  J. 
2},  ....  Nj.  Therefore,  the  sequence  g(")  ob¬ 
tained  by  adapting  g4<tj  with  a  adapting  period 
of  T  •  1,  l.e„ 


9(»)  *  94(b)  *  94  (-1") 


is  equal  to  zero  for  1  «  n  «  n 
the  Fourier  transfera  of  g(n)  Is 


«) 

Furtheraore. 


6(ej“) 


kL 


(((v  ♦  ivk) 


O) 


Therefore,  since  (.(v)  vanishes  outside  the  In¬ 
terval  ,J-u  ,  w  )  end  w  <  v  ,  It  follows 
that  6(*J  )  1  if  bandl  tailed  to  [-w  .  «  ]  and 
g(n)  Is  the  sequence  idilcn  was  to  be  constricted. 

Pole  that  although  Property  C2  Is  stated  in 
terns  of  a  bandlinlted  sequence  being  zero  over 
an  arbitrary  Interval,  It  aay  in  fact  be  shown 
that  a  bandlinlted  signal  aay  always  be  found 
which  is  zero  over  an  arbitrary  finite  set  of 
fndtdes,  i.e.,  n.  for  k  •  1,  2,  ....  X.  1" 
particular,  all  that  is  required  Is  to  re¬ 
place  h4(q)  In  (S)  with 


Ajld) 


!  c-^j 

k.i 1  «-,»k 1 


(8) 


In  the  proof  tf  Property  C2  above,  a  proce¬ 
dure  was  outlined  for  constructing  bandlinlted 
sequences  which  are  zero  over  arbitrary  finite 
length  Intervals.  There  exists,  however,  an 
taster  and  auch  acre  efficient  procedure  for 
generating  these  sequences. 


Specifically,  l«t  f(n)  be  an  arbitrary  bind* 
liatM  sequence  whose  Fourier  trMtf«n  It  ttre 
outside  the  interval  [x  ,  w  ].  Consider  tM 
sequence  g(n)  defined  by 

H 

g(ii)  .  f(n)  •/(«-»)  (9) 

for  !«t  constants  \  tor  k*1.2 . .  A.  Cl  ear¬ 

ly,  since  g(n)  may  be  obtained  from  f(n)  with  an 
MR  fitter  whose  tyttea  function  It  given  by: 

11  a 

Mi)  •  »  -  l  *ki  (10) 

»•!  * 

then  g(n)  aiutt  alto  be  a  bandllnlted  sequence 
atilcb  hat  a  Fourier  transform  equal  to  zero  out- 
tide  the  Interval  (-■»-.  “,]  •  boa  tuppote  ae 
taipote  the  constraint1  Chat  g(n)  •  0 

for  1  «  n  «  R  in  (9).  Ihlt  leads  to  a  ttt  of  N 

equations  In  the  h  unknowns  a,  for  k*l . R; 

i.e., 

h 

f(")  •  l  a.  f(n-k)  n*!.2.....  A  (11) 

k*l  * 

Therefore,  assuming  that  a  solution  to  (11) 
Kists.  a  sequence  aitn  the  desired  properties 
may  be  constructed.  The  question  arises,  hew- 
ever,  at  to  me  eaittence  of  a  solution  to  (11) 
for  a  given  N  and  f(n)  or,  more  importantly,  hoa 
to  select  a  f(n)  tucn  that  a  solution  to  (11) 
ealsts  for  any  h.  To  address  this  question. 
Suppose  that  f(n)  it  a  bandllnlted  sequence  alth 
a  real  and  non-negative  Fourier  transform,  i.e., 

HeJ")  »  0  for  -a  «  a  <  a  (1?) 

In  this  cate,  f(n)  corresponds  to  a  valid  auto¬ 
correlation  function  and,  as  a  result,  equations 
(11)  represent  the  familiar  normal  equations  of 
linear  prediction  tneory  which  nay  be  solved  by 
the  Levinson  or  Durbin  recursions^].  Thut,  (11) 
will  have  a  unique  solution  for  any  N  except  for 
the  case  In  which  f(n)  is  of  the  form: 

M 

f(")  •  I  fkcos(w  n  a  q  )  (13) 

k«l  '  * 

For  sequences  having  this  form.  (11)  has  a  unique 
solution  only  for  those  values  of  h  for  which  h  < 
«  10. 

As  an  example  which  Illustrates  this  proce¬ 
dure,  shown  in  Figure  1  Is  the  bandllnlted 
sequence 

sin  w,n 

'(«)  *  Tth  <»«> 

e 

tdiere  w  •  0.2*.  Shown  In  Figure  2  is  the  band- 
limited  sequence  g(n)  with  a  gap  of  A«25  zeros 
which  was  generated  from  f(n).  Figure  3 
shows  A (eJ“),  the  frequency  response  of  the 

filter  used  to  obtain  g(n)  from  f(n).  Finally, 
Figure  4  shows  the  Fourier  transforms  of  f(n)  and 
9(n). 


0ISCUSS1M 

Rate  that  Property  02  has  some  important 
implications  for  the  discreth  band! lotted  extra- 
potation  problem.  Clearly  it  Is  not  possible  to 
uniquely  extrapolate  a  bandllnlted  sequence  from 
a  set  of.  R  consecutive  samples  of  a  bandllnlted 
sequence  since  there  are  an  Infinite  number  of 
sequences  which  are  identically  zero  over  the 
same  interval  and  banal tmited  to  the  tame  or 
lower  cutoff  frequency.  Obviously  any  one  of 
these  sequences  could  be  added  to  the  original 
signal  without  changing  the  R  samples  available 
to  us  anq  without  violating  the  banal  imitated 
constraint.  Indeed,  the  constructive  proofs 
given  above  can  alto  be  viewed  as  eatrapolatton 
methods,  since  we  can  create  a  sequence  with  a 
gap  of  R  zero  samples  and  insert  the  A  known 
samples  from  the  original  bandllnlted  sequence  to 
obtain  an  infinite  sequence  Milch  Is  banolimlteo 
to  the  desired  cutoff  frequency,  and  which 
matches  the  original  sequence  In  the  inter¬ 
val  1  c  n  <  R. 

Thus  Property  D2  Implies  that  there  It  no 
unique  solution  to  the  discrete  bandllnlted  e>- 
trapolatlon  problem.  Although  this  seems  to  be  a 
well  known  result,  we  have  not  seen  It  proved  in 
either  of  the  above  ways.  One  approach  which  has 
been  proposed  to  obtain  a  unique  solution  Is  to 
apply  adottlonal  constraints  such  as  minimizing 
the  energy  of  the  eatrapolatton. (2)  Perhaps  the 
constructive  methods  for  creating  eatrapolatlons 
of  band-limited  sequences  will  be  useful  in 
choosing  additional  constraints  to  apply  In  the 
eatrapolatton  process. 
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abstract  ■  •  ,*fc  $v  w> 

kecent  mu' I  by  md  Speike  (1,2]  Such  I  Uft  Is  derived  M  discussed  '«  [J.,. 

Ms  snawi  tut  «uU  Idiment'Ohil  discrete  Fourier 
fmsfo'rs  iJMs)  cn  be  defined  for  tignilt 

defined  un  n/  periodic  tempting  littice  I'd  till  HAtRik  COUlMlKlf  AtCORIIM 

tie/  cii  be  e»«ljited  using  *  generilljilion  of 

tie  Coo'ij-lwiejr  FI  I  ilgorunm.  tne  mill  pjrpote  In  tMs  section  we  *111  Outline  e  »*tn» 

i-f  tut  »j»»  ait  to  develop  iltemitive  Pgo-  -jeieritwitidn  of  tie  Coolejr-Ii.se/  fist  Fou'ter 
nti-t  .n«cn  mt't  nj't  Suitible  to  mghly  oirel-  trms'ora  (Fit)  ilgo'lth*  '«]  un*c«  cm  be  used 

'id'ne  e'cMtectuees  md  which  required  less  to  eviljite  |l).  A  i»re  complete  discussion  is 

ditl  Midi  mg  tnm  tie  CvO  Vy-tjkey  llgorttnms.  given  In  (2). 

S.'n  m  ilgorlth*  >s  described  n«re.  It  «»«es 

use  of  He  Vitn  no*»*l  for*  rejreseit it io«  of  in  The  key  to  tne  efficient/  of  tne  9ene'ill<ed 

uteje'  eitri*.  As  d  t’del'gnt  to  tnis  *ori  e  Coole/-tu*e/  ilgo'itn*  is  tne  fectoriblHt/  of 

Jnmese  '*»e'nj*r  tneore*  for  litticet  nit  been  tne  periodicity  «itri«,  I.  It  Is  aril  mown  tnit 

3»velrp»d  eucn  permit  m  e* tension  of  wood's  tne  efficiency  of  i  •  ffl  ilgoritm  depends 
^r>*e  f  let  or  iljarttns.  Tnis  is  i’so  described.  strongly  upon  tne  lc"gtn  o*  tne  trmsfO'T,  h; 

tnese  ilgontnms  become  truly  efficient  on  1  /  when 
A  Is  I  nlgnly  composite  Integer.  S'vnUrly, 
IHTHUDuCflOH  efficient  Cooley-tuiey  ilgorithms  for  the  multi* 

dinensionil  problem  eilst  *nene»er  the  per*od»cl- 
*n's  bluer  Iddresses  the  proble*  Of  evllult-  ty  *ltrl«,  H,  Is  I  composite  Integer  eitrl*.  If 

|"9  I  sen«»i1  nultid'*>ensiCi«il  discrete  Fourier  R  is  composite,  ft  cm  Ue  written  is 
trms'or*  uft)  Of  tn*  fora 

.  .  H-ffl  (i) 

HA)..  .(a)  e.p.-jk'tf-H'V-  <>> 

•  1_  where  f  md  Q  me  Integer  aitnces  such 

"  tnit  Idet  t  1*2  md  Idet  Q  |»2.  (As  m  is*de  *t 

tie  sequence  «'.n)  md  *ts  OM.  «(k).  ire  issued  cm  De  noted  tnit  H  is  fictorible  whe*e*er  the 

to  0*  •(-dieensloniF.  InuS,  a  md  k,  the  signil  ibsolute  vllje  of  Its  deteralnmt,  which  «ust  be 

dMI-ln  I  id  Fourier  doaiin  independent  v|rt«plet  m  Integer,  Is  not  one  or  I  prlae  nuober.  Such  I 

ire  K-dieensionil  column  vectors  aim  Integer  fictorlcition  is  not  unique,  encept  possibly  In 

coefficients.  The  non-fero  simples  of  «(a)  ire  the  one-dloenslpnit  else), 

confine J  to  the  region  lg  In  the  signil  doaiin. 

The  *ltrl«  ■  IS  mown  is  the  periodicity  ni-  The  Sumrit Ion  In  (l>  produces  I  distinct 

t r I* .  It  Is  m  M  >1  nitric  with  integer  elements  vllue  for  |det  Hi  different  vilues  of  k  md  it  Is 

wnose  ro’e  In  the  agltl-dlaensloml  UFT  Is  Invertible  If  the  region  lw  ilso  contilns  Idet  Hi 

milogous  to  tne  trmsfora  length  of  i  simples.  These  Idet  H|  viiuts  of  n  md  k  cm  be 

one-dlmenslonil  llgorltha.  For  the  tridltlonil  eipressed  IS 

Vfl,  which  relites  i  rectmgulir  1/  stapled  signil 

to  'ectmgulir  simples  of  its  Fourier  trmsfora,  k*Qa*l*Rr  (4i) 

H  is  dUgonil,  but  non-dllgonil  periodicity  ae- 

trices  cm  occur  In  computing  the  OFT  of  I  signil  *  •  f  p  p  I  I  (kb) 

which  Is  not  rectmgulirly  stapled,  for  enaple, 

t  two-d'aenstonil  uft  whicn  relites  e  heilgonilly  where  p  end  a  cooe  froa  sets  of  Integer  vectors 

s«ipled  signil  to  Hengonil  simples  of  tts  contitnlng  |det  f I  members  md  q  md  1  co*e  from 

Fourier  trmsform  uses  tne  periodicity  aitrli  sett  of  integer  vectors  contilning  Idet  Ql  mem¬ 

bers.  Substituting  eqt.  (4)  Into  (I)  reduce*  tne 


•This  work  wes  supported.  In  pirt,  by  the  Nittontl  Science  Founditton  under  grint  lCS-7#tT2ut  md  by  the 
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dK^itltlwi  of  I  attrll-l  Off  into  IM  Coaputl- 
t ion  of  10*1  P|  aitrti-Q  Of  Tt  pint  Idet.QI  ai- 
trw-P  ufls  pint  |d*t  ■(  iddilloni'  coaplei 
■nit  'pi  tcit'ont.  In*  ft* l  (OipiitilloKl  effort  It 
lett  mm  if  (1)  'I  eviluitrd  directly.  tf 
eitner  P  or  Q  «t  tonpotUt,  t  tlallir  decomposi¬ 
tion  c«n  oe  tited  to  eviluite  to*  vutler  Of  ft. 

To*  two  mutt  Common  ilgortthat  for  evllult- 
ing  me  nglt  Idtaentlonil  rectingulir  Oft,  tne 
r;..co1j*  ilgorltnm  md  tit*  vector-ridli  ilgo¬ 
rltnm,  correspond  to  tpeCtll  Cites  Of  this  llgO- 
nvnm.  Tnelr  specific  relittonsntp  to  tn* 
jene'd  1  •Igoritnri  tt  dttcutted  In  [i). 

•nt!«  tn*  generiMcrd  Cooley-tvkey  llgorltha 
it  e’egmt  mom  i  cuncept-il  point  of  view,  It  It 
difficult  to  Imptenent  for  non-dtigonil  periodi¬ 
city  netricet.  In*  difficulty  Het  witn  tn* 
vector  wqjtnlent  of  tn*  blt-reuertil  operi- 
t ion.  A  resolution  of  tint  difficulty  It  known 
for  tne  ne»«]on*l  cite  ISj.  but  tn*  resulting 
jljO'itn.T  reguir*c  ;nit  the  aiti  be  r*inoe»ed  it 
»i:n  oeci-itioi  ttig*  in  tn*  ilgorttna.  Tne 
jljoritn.n  Jvr.rioeo  m  tn*  ne,t  section  reduces 
tie  i mount  of  dill  snuffling  required. 


SMITH  HOKUM.  TORN 

•n*1  ■  It  dllgoul,  1  mjl  t  Id  'rent  lonil  JfT 
cm  be  efficiently  Implemented  uting  (ltn*r  tne 
rowtjlurn  ilgorttM  or  tne  >ector-rid«i  llgo- 
cltr.n.  |f  H  it  non-dllgonil,  w*  cm  develop 
1  ml  I ir  ilgoritnmt  if  we  flrit  writ*  it  In  Smith 
nornil  form 

•  ■HI  (S) 

«n*r*  o  is  m  integer  ougonii  eitm  md  H  md  1 
t r*  uniTjOj'ir,  1.*  ,  |d*t  U.Mdet  »i*l  md  II  md 
»  me  Integer  ritncet.  Tn>t  dero-iposlt Ion  cm 
be  performed  by  eiecutmg  element  my  row  md 
column  opr' It  Ions  on  H. 

Substituting  eg.  (S)  Into  (I),  tn*  OfT  turn- 
r*t»on  cm  b*  written  it 

!;»)•  „  1(H)  *ipiO*V,.?rp‘,/*-,fli.  (•) 


How,  If  w*  define 

.  .  u"*» 

; . 

tn*  JfT  tj-nrilton  reduces  to 

»(»)  •  t  «(«)  (T) 


Tnlt  turn  repr«t«ntt  «  milrli-Q  Off.  furtn«raor« 
tine*  H  tt  uninodjlir  ft  .md  n  Off  In*  tn*  ton* 
•ltt*cf •  Tn*  sequence  i(a)  1\  v'apty  t  rtlndtied 
version  of  ■(*).  Slalltrly  i(k)  It  *  rfindeaed 
rertlon  of  «(*).  Thtt  decomposition  provides  tne 


following  ilgorttna  for  eviluliting  «  aitrtu-H 
OfT: 

Algorithm  A: 

1.  (iprett  I  m  SaitH  norail  fora  it 
H  •  U  0  V. 

2.  Scrmole  the  Inpyt  mriy  KCO'Omg 
to  tn*  relit <on  a  •  r'l. 

S.  Compote  1  uff  of  tn*  resulting 
irriy  using  i  nttrti-D  ilfl.  Since 
0  It  dltgonil,  tntt  cm  oe  Oone 
using  *ltner  1  row-coluwi  0*1  or  t 
vector-ridli  ffT  ilgorltha. 

A.  Untcrimble  tne  output  sequence 
iccording  ..  to  tne  r*li- 
t  Ion  lift. 

Observe  tnit  wltn  tnlt  tlgorltnm  tne  aultiotmen- 
t'cnil  irriyt  need  to  be  relndered  tt  most  twice, 
once  it  tne  beginning  ot  tne  ilgorttna  mo  once 
•t  tne  end. 

while  tne  eitrtr  P  It  in  H  i  M  Integer  at- 
frit,  the  mitria.O  lift  it  step  i  of  tne  llgg-ltna 
it  not  nerettmily  m  N-di»entlo«il  OfT.  to 
it  lost  rite  tils  ftet  consider  *  t>o-d*me«siontl 
nit'll  N  IjfT  fur  «ni;n  |det  K!*H]S^.  for  tone  H 
tne  ait' II  0  Will  ttsume  tne  fora 


for  other  H  tie  mt'li  0  *111  ittear  tne  Cora 


«1  though  tn*  Smith  no'Bil  fora  for  t  ailm  It 
not  unique,  the  fora  lor  Itt  diigoml  substitute 
noratlly  it.  (On*  exception  to  tnlt  tllte-nent  It 
discussed  below). 

If  •  nit  tne  fer*  of  eq.  (d),  tn*  off  of 
ttep  J  Of  Algorithm  A  corresponds  to  m  h,  ■ 
two-dlmentlonil  rectmgulir  UfT.  If  D  nit  tne 
fora  of  eq.  (y),  tmt  vfl  it  m  h|  ■  Sj-pomt 
one-diaentioml  In  I .  Since  tne  two-dtaentto**1 
trmtfora  cm  be  computed  ao'e  efficiently  tnm  i 
on*-di*entton«1  trmtfora  wltn  the  time  numoer  of 
points  m  I  mitrii  whose  dugontl  substitute  It 
pf  tn*  f«ra  o*  eg.  Ill  It  to  be  preferred  over 
on*  of  the  fo-a  of  eg.  (*).  If  hj  mu  me 

relitlvely  prla*  the*  d>tgo«i1  egylvilentt  of 
either  fqra  evltt.  Tnlt  fict  wit  eiploited  by 
Good  [t]  whose  prla*  *«lor  Ilgorttna  represents 
m  efficient  Ilgorttna  for  evllultlnf  •  1  D 

Off.  Tn*  prla*  fictor  ilgorttna  works  by  mitlng 
•  1-0  UfT  it  1  ?-0  uff  Whose  periodicity  aitrl. 
ft  of  tng  fora  of  (9).  Tn*  dltl  me  then  n*r*wt- 
ed  Into  i  fom*  where  th*  periodicity  aitru  mt 
tn*  fora  pf  eg.  (a)  which  it  men  *»i  tteg  using 
I  row-coluan  two-dlaenilontl  UfT,  with  m  itten- 
dint  coaputlt  10*1 1  livings. 


In  a  similar  fashion,  «h«n  A  is 
M.di*e»stona1.  tne  dimensionality  o f  the  mat'ia-0 
J*  I  nay  <i'j|  from  1  to  N. 


tw  ae  i'»  ready  to  present  #  Chinese  re¬ 
mainder  thtorn  tor  integer  vectors.  Suppose 
that  ■  ts  0  composite  integer  Mtri>  Such  tnet 


WTHII  P* IK  FACTOR  ALGORITHM 

nitn  Algorithm  A,  the  evaluation  of  en 
m-d mens io«*l  natria-A  oft  can  oe  accomplished  by 
•tens  of  «  rectangular  OFT  of  dimension*! ity  tess 
to  in  jr  egi.il  to  ft.  It  ctn  ilso  oe  accomplished 
Sy  js<nq  •  higher  dimensional  rectangular  OFT  by 
us'ng  i  generalization  cf  Good's  prime  fiCtor 
ilgjritnm  [ej.  To  eaplaln  tne  algorithm.  ho«- 
uer.  »e  nil  I  need  some  results  (ran  letttce 
tnoery. 


ill  0;.  o, .  4*  be  H  Imeirly  independent 

*ect:'i  in  t%e  m-dimensional  rt*i  Euclidean 
spare.  lie  set  of  sectors 

i  •  ujij  ♦  ...  *  (Hi) 

•  'tn  integral  jt,...,  uu  >s  called  tne  latt ice 

•  lth  oasis  A;...*. ,  a*.  if  tne  sectors  a, . 

ft  combined  into  a  -*t'i«,  A.  eg.  (Id)  can  be 
•ntten  as 

a  •  A  u.  (II) 


“  ’  ri«l  *  Vd 

■diere  ’det  RjMdet  Ryl*p,  |det  UjMdet  0,1*0. 
and  p  and  q  are  relatively  prime.  Then  lj/(iT  is 
ispmorphic  to  t|yp;  *  1|/qT-  Thus  any  integer 
sector  k  from  tne  "region*  L  can  be  repre¬ 
sented  by  tne  sector  pair  (hj.  s mere 

•  k  mod  R^ 
k;  •  k  mod  q{ 

Tne  Inverse  mapping  is  given  by 


d«) 


k  •  ,A  kj  ♦  •  ky,  mod  « 


Rk, 
•  k. 


fcj  906  Pj 
^  -od  t|{  . 


.rt  tne  lattice  generatea  oy  tne  matria  A  oe 
■vouted  «.g.  Tnere  is  a  one-to-rjnj,  relationship 
te:«ren  lattices  ana  matrices,  to  each  nonsingj. 
lar  ratrta  a  corre sponds  a  lattice  La.  but  to 
each  lattice  ig  the**  is  a  •hole  class  of  nonsin- 
gul ar  eat' ices.  T«o  matrices  A  and  •  belong  to 
tne  same  class  If  A  •  ■  U  mne'e  0  is  a  unimoduta' 
c atria. 


A  second  isomorphism  Is  given  by 

h  •  ,«j.  »ij 

■here  Q^ISj  ♦  Rjis,  .  ■  mod  N. 

Substituting  the  t«o  inverse  relations  Into  eq. 
(I),  tne  lift  summation  can  be  ■rltten. 


!f  a  lattice  l>  is  contained  in  a  lattice 
i*.  tnen  :■  is  called  a  viblstt  ice  of  lg.  In 
this  case  *  •  A  C  one'e  C  is  an  integer  mat'll, 
ine  set  of  vectors  common  to  to  lattices  lg  ano 
lq  constitute  a  lattice  ln  called  tne  qreatest 
v.ipin  Suulattice  of  lg  ang  Lg. 


*1^11  r  i  *1*1*  My.  * 

V  ll/R,  V  ll/Q  ? 


*■»’*!  Pl\  -J*.kj  Oj’  My 


(«» 


If  *  and  ■  are  to  vectors  belonging  to  a 
lattice  la.  a«d  if  ig  is  a  sublattice  of  lg,  o 
■’ll  say  l"*t  n  is  Congruent  to  ■  modulo  I,  ur  it- 
ten 

M  i  •  (modulo  •)  (12) 

If  (*-•)  ts  a  vector  belonging  to  this  rela¬ 
tion  defines  a  set  of  equivalence  classes,  called 
tne  set  of  residues  modulo  R,  »here  a  class  (•) 
is 

>]•;■  >  lg  such  that  ■  i  *  (modulo  ■))  (13) 
this  set  of  classes  Is  denoted  tgyg. 


Tne  resulting  algorithm  Is  similar  to  Algorithm  A 
In  that  it  involves  shuffling  the  data,  pe'furm- 
ing  a  lift  and  tnen  shuffling  the  result.  Tne  GFT 
formula  in  (12)  reduces  the  computation  to  a 
number  of  smaller  WCs.  A  oatrii-l)  GFT  Is  eval¬ 
uated  for  each  value  of  tne  Indea  *\  and  then  a 
mitria-R  luF I  is  evaluated  for  each  value  of  the 
Indea  ky.  The  nuvbf'  of  complea  multiplications 
Is  then 

M  •  I det  R|mj  e  |det  Qlm^,  (13) 

uhcre  mj  and  my  are  the  number  of  multiplications 
for  *  mat'll. F  OFT  and  a  matrla-Q  respectively. 


untie  eq.  (12)  indicates  the  required  compu¬ 
tations,  it  is  not  clear  that  an  efficient  order- 


KfUtKES 


♦nj  for  the  doti  ce*  be  found.  Thet  UU  t»  mode 
eti'w  if  «  ttenoo  d  enh  for  «ocn  of  tho  let* 
tl<n  ond  MilUIlm  It  uted.  elth  no  loti  of 
jnwiliiji  itt  ut  confine  oertelvet  to  tne  t*o- 
dtmentionol  cote  end  let  ut  contider  tne  evoluo- 
t (o«  of  o  2- U  Mlrli-f  Iff  of  tne  form 

4<»>  •  l  *(•)•»>;- JJukV1*,’  (14) 

*  Ll/P 

let  P  •  {pj.p^J.  Then  It  con  be  thoan  thet  there 
eoltt  vectors  ij  ond  tuch  thot 

‘1  *  hll»l 
*2  ■  •'zjPi  *  *u*2 
n„  »  0  .  nw  >  h^  ,  0 

ond  »j  tnd  i,  for*  .  bot<t  for  Ip.  Inverting 
tnete  time  ions  givet 

Pt  «  9U«, 

*t  *  «2i*i  *  h2\ 

’ne"  tne  tet  of  vectors 

ul*l  4  *2*2 

for  *«t »ilu«  of  uj  *nd  t*ie  r$*qt 

0  t  ut  <  9ji 

0  ‘  J2  <  *22 

constitute  o  repretentot l*e  t/stem  of  retidut 
clottet  for  t,,_  .  S'«ilor|j  mere  e«ttt  vectors 
»i.  ond  integer*  l...  tuch  thtt  tne  tet  of 

vector*  11  «* 

Vl  *  Y2»2 


constitute  o  repretentot  <*e  tet  of  residue 
clottet  for  It  •  ,hut  If  «(«)  •  "(uj.uj)  ond 
t(k)  •  Mvj.v^J,  the  OFT  becomet 
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Dimensionality-changing  transformations  with 
nonrectangular  sampling  strategies 


RimmA  M.  MwMTMtt  Abstract  Thi»  paper  ts  concerned  with  tha  uaa  ot  dimensionatity- 

Geo'oia  Institute  cl  Technology  changing  Uentformetiont  (or  tha  digital  processing  of  signals  that  have 

School  of  Electrical  Engineering  been  sampled  on  sampling  lattices  other  than  the  lamiliar  rectangular. 

Atlanta  Gaoroia  30332  01  row-column  one.  Alter  introducing  the  idea  ot  nonrectangular  sam¬ 

pling.  the  paper  formally  presents  a  particularly  useful  class  ol 
dimensionality-changing  transformations  and  presents  conditions  under 
which  they  can  be  used  for  signal  processing.  It  does  this  by  means  of  a 
vector  notation.  Tne  major  result  Is  that  the  use  ol  sucli  transformations 
with  nonracianguiafly  sampled  data  is  no  It:  ■  restrictive,  no  more  dif¬ 
ficult.  nor  substanti*"y  different  than  with  rec'anguiarly  sampled  date 


f.  OVERVIEW 

A  topic  of  increasing  intern!  in  the  dif  iial  signal  processing  com¬ 
munity  concern*  the  representation  and  processing  of  multtdunen- 
sional  signals,  such  as  images  and  electromagnetic  field  distribu¬ 
tions.  on  periodic  but  nonrectangular  sampling  lattices.  These 
representations  are  important  for  digital  signal  processing  because 
they  can  mean  a  reduced  sampling  density,  uhich,  in  turn,  means 
reduced  storage  and  reduced  computation;  they  would  appear  to 
hasc  some  adsantages  foe  spatially  discrete  optical  processing  as 
well.  One  of  these  alternatisc  sampling  strategies,  in  fact  the  most 
common  one.  uses  a  hexagonal  sampling  lattice  foe  two- 
dimensional  12-Dl  signals. 

Reseesibk.  linear,  dimensionality-changing  transformations  per¬ 
mit  both  optical  and  digital  signal  procc.sinf  operations  to  be  per¬ 
formed  using  intermediate  signets  whose  dimensionality  may  be 
different  from  both  the  original  and  ultimate  signals.  These 
transformations  are  well  understood  in  the  context  of  rectangularly 
sampled  signals'  and  they  can  also  be  used  t*rii  nsNirecrangutir 
representations.  It  is  the  use  of  dimensionality-changing  transfor¬ 
mations  with  nonrertangular  sampled  representations  that  forms 
the  central  topic  of  this  paper. 

This  paper  is  divided  'nto  three  parts.  In  the  first  part  a  general 
framework  for  the  nonrectangular  sampling  and  discrete  processing 
of  bandlimited,  spatially  continuous  signals  is  presented.  In  this 
presentation  a  vector-matrix  notation,  which  has  been  found  to  be 
particularly  useful,  is  used.  It  allows  the  basic  concepts  to  stand  out 
tot  an  uncluttered  fashion  and  makes  generaliulion  obvious.  In  the 
second  pan  of  the  paper  dimensionality-reducing  transformations 
for  rectangularly  sampled  signals  are  reviewed  using  this  notation. 
Attention  is  directed  toward  the  performance  of  signal  processing 
operations  using  these  transformed  signals,  in  the  final  section  of 
the  paper,  the  results  of  the  first  two  tarts  are  brought  together  in  a 
consideration  of  dtmensionalily-clii':ig  transfotmationi  for 
nonrcctangularly  sampled  signals. 

2.  NONRECTANGULAR  SAMPLING 

If  fcfx,y|  denotes  a  spatial):  cr  itinuous,  two-dimensional  signal, 

ihe  operation  of  reeling urtr  sampling  can  be  described  by 


f(n,.nj)  -  fJn,X,  n,Y) . 


(I) 


f(fx.y|  It  bandlimited  such  that  its  Fourier  transform,  F(tu,v). 
satisfies 


Fjiu.vi  w  o  .  u  a  :*!  G> 

then  fc|x.y)  can  be  exactly  recovered  from  the  array  of  sample 
values  given  in  Eq.  II). 

We  can  define  ike  Fount*  tnm/arm  of  the  sequence  fin rn f. 
which  we  shall  denote  as  Ffu.v).  according  to  the  formula 


Ffu.v)*  J  2  f(n,.nj)exp|-j2vn,Xu -j2tn,VsJ .  (}) 

n,  n, 

Ffu.sl  is  periodic  in  both  u  and  v  with  a  period  of  l/X  in  u  and 
period  |/V  in  v.  Such  a  function  is  said  to  be  rwrangvIeWr 
periodic. 

The  Fourier  transforms  of  the  continuous  and  discrete  signals  arc 
related  by 


F(u.v) 


I 

XV 


2  2 


|4) 


When  Ms.yl  is  bandlimited  and  sampled  with  sampling  rates  in  ex- 
cess  ol  •>.:  Nyquisi  rates.  Ftu.v)  and  Fc(u.v)are  proportional  to  one 
another  and  ate  given  by 


Ffu.v) 


I 

XV 


Fflu.v) , 


In! 


< 


I 

2X 


M 


(51 


Notations lly.  all  of  these  expressions  can  be  simplified  if  we 
adopt  a  sector  notation  for  the  signals.  By  defining  x  »  h.yl',  u 
■  fu.v)’ ,  etc.  |<  )'  denotes  a  sector  transpose!.  Eqs.  (II.  (I).  and 
|4|  can  be  written  as 


fin)  -  f^n) 
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where  X  and  V  are  the  horizontal  and  vertical  sampling  intervals.  If 


%  - 


r  *  * 


F<*)-  £ 

* 


<n 


.iu)  - — ! — jF^.-ar'Ti.  <•> 

i*iX!  f 

Tlir  m.urix  J<  is  known  u  the  sampling  matrix.  Fo«  the  rectangular 
sampling  bt'it  defined  ia  £4.  (I),  it  b  pvei 1  by 


X- 


O) 


Although  ihc  exact  (arm  for  these  expressions  hat  heen  developed 
only  with  tome  foresight,  (hr  contittency  of  £41.  (6)  to  (ft)  with 
Eqt.  (I).  (3).  and  (4)  should  be  evident.  It  thou  Id  abo  be  clear  .hai 
fc'qs.  (61  to  (HI  can  also  be  uted  to  detenbe  M-dimemional  rect- 
angular  sampling.  In  Ihit  cate  A  becomet  an  M  x  M  diagonal 
matrix  and  n.  u,  and  IT  become  M -element  column  vectort. 

Periodic  nonrectangular  sampling  can  alto  be  dct.ribed  by  Eqt. 
(6)  to  (l|.  The  only  difference  it  that  X  11  no  longer  diagonal.  In 
fact  the  columns  of  X  are  vectort  whose  integer  linear  combina¬ 
tions  define  the  sampling  lattice.  At  an  example  ttc  have  the 
nonrectangular  lattice  shown  in  Fig.  1(a),  which  corresponds  to  the 
sampling  matrix 


00) 


It  should  be  noted  that  although  this  lattice  is  nonrectangular  it  it 
nonetheless  regular  anti  periodic.  In  Fig.  1(b)  we  thow  the  Fourier 
transform  of  the  sequence  defined  by  the  sampling  lattice  in  Ftg. 
1(a).  F(u )  is  periodic  in  u;  that  is. 


F%.  2.  A  hexagonal  sampling  lattice. 


F(u)  »  F(u  ♦  UY) 


for  any  integer  vector  T.  where 


U  -  (X  )  '  - 


(H) 


(»> 


U  is  called  the  pthodtcitr  matrix  of  the  periodic  signal.  If  f  (7)  is 
band'imited._such  that  its  Fourier  transform  is  confined  to  one 
period  of  F(  u ),  then  f^T)  can  be  recovered  exactly  from  f(  n ).  It 
ia  interesting  to  note  that  the  sampling  density  is  given  by 


IdetJil 


I 

ItMAI 


which  is  also  equal  to  the  area  of  one  period  of  F(  u ). 
Hexagonal  sampling  corresponds  to  the  sampling  matrix 

r*  *1 


£- 


(!»> 


Y  -Y 


for  which  the  sampling  lattice  n  drawn  in  Fig.  2.  If  X  -  (2/s/3)Y. 
each  sample  of  the  hexagonal  lattice  is  equidistant  from  six 
neighboring  samples.  Hexagonal  sampling  i:  optimal  for 
bandlimited,  spatially  continuous  signals  w  hose  Fourier  transforms 
are  confined  to  an  ellipse.  (This  ellipse  becomes  a  circle  when 
X  « (2/v'3)Y.)  By  this  we  mean  that  of  all  the  sampling  lattices  that 
permit  an  exact  reconstruction  of  Ihc  spatially  continuous  signal, 
the  hexagonal  one  has  the  minimum  sampling  density.  This  result 
was  shown  by  Petersen  i.nd  Middleton.2 

Nonrectangular  sampled  signals  can  be  used  for  signal  processing 
as  well  as  for  signal  .epresentation.  often  with  significant  computa¬ 
tional  savings.  A  number  of  signal  processing  algorithms  for  the 
hexagonal  case  have  been  derived  by  Mersrreiu.*  Specifically, 
algorithms  have  been  devt^ped  for  tincar,  shift-invariant  filtering, 
discrete  Fourier  transform  calculation,  frequency  response  evalua¬ 
tion,  and  filter  design.  For  isotropically  bandlimited  2-D  signals, 
ncxagonal  represents  jr  can  mean  storage  savings  of  13%  and 
computational  savings  of  25  to  (0%  over  the  comparable  rect¬ 
angular  representations. 

3.  DIMENSIONALITY -CHANGING 
TRANSFORMATIONS  FOR  RECTANGULARLY 
SAMPLED  SIGNALS 

A  dimenstonahljnehanting  tratuformttkm  h  a  rearrangement  uf 
the  samples  of  a  finite  extent  R -dimensional  anry  onto  an 
S-dimensioiul  array  (S  *  K)  that  contains  the  same  number  of 
samples.  The  motivation  for  such  transformations  is  .imply  one  of 
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commoner  ii  may  be  tiutf  lo  process  an  R -dimensional  signal  by 
Fust  mapping  the  signal  10  an  S-dimensional  lattice,  processing  the 
resulting  S-dimensional  signal,  and  then  mapping  the  result  back  to 
an  R -dimensional  format.  If  R>S,  such  a  transformation  is  called 
dimtiwvitahtr-redurin/;  if  R  <  S,  it  is  called  duntnuunahiy- 
incmtmi.  Clearly,  the  inverse  of  a  dimensionality-reducing 
transformation  (DRT)  is  a  dimensionality-increasing  transforma¬ 
tion  (OITI.  A  simple  e sample  of  a  DRT  is  the  lexicographic  order¬ 
ing  of  the  elements  of  a  finite  area  2-D  array.4  This  corresponds  to 
concatenating  the  rows  for  columns)  of  the  2-0  sampling  lattice  to 
form  a  long  l-D  sequence. 

For  a  dimensionality-changing  transformation  to  be  useful  for 
signal  processing  it  should  satisfy  at  least  two  conditions.  First, 
operations  such  as  linear  Filtering  and  Fourier  transformation  of 
the  S-dimensional  signal  should  correspond  to  meaningful  opera¬ 
tions  on  the  R-dimensional  signal.  Secondly,  the  transformation 
must  hr  uniquel)  inscrtihlc  for  arrays  with  a  Finite  number  of 
samples.  The  latter  requirement  is  necessary  if  we  are  escr  to  he 
able  to  return  from  an  S -dimensional  to  an  R-dimensional  formal. 

A  particularly  useful  family  of  dimensionality-changing  transfor¬ 
mations  for  signal  processing  are  URT's  for  which  the  sector  in¬ 
dices  of  the  R-dimen\ional_  arrays  are  mapped  linearly  lo 
S-dimcnsional  indices.  If  fK(  n  I  denotes  the  R-dimcnsional  array 
and  gs(  n  )  denotes  the  S-dimensional  atray,  then  such  a  mapping 
can  be  written  as 

gs(Tn)  *■  **<">.  (14) 

w here  _T_  is  an  S  «■  R  matrix. 

l  et  the  region  of  the  R-dimensional  sampling  lattice  w  here  f„(  n  I 
is  non/ero^bc  denoted  by  f .  The  transformation  J_  w ill  he  ins crnhlc 
if  m  =J_n  is  unique  for  every  sector  n  in  I.  In  what  is  ts>  follow, 
we  shall  assume  thal  T_  represents  an  insertible  transformation, 
although  inis  inverse  operation  cannot  genctally  be  dcss'ritscd  as  a 
matrix  opcianon.  fit  can.  howrscr,  be  implemented  using  a  look¬ 
up  table! 

As  an  example  consider  the  rowwise  lexicographic  ordering 
discussed  earlier  for  the  case  R  *  2,  S  «  I .  Thai  is,  we  wish  to  form  a 

1- D  sequence  by  concatenating  the  tows  of  a  2-D  sequence.  Let  the 

2- D  sequence  occupy  an  N,  >  N.poinl  sampling  lattice.  I  his  map¬ 
ping  is  defined  by  the  transformation  matrix 


defined  at 


Ftf  u )  ■  2  f,(n  )expf-j2»  u'  n|  (t»| 

n 


Gs(v)  •  J  gs(  m)exp|-j2»s'  m|.  (20) 

m 

(dlls  assumes  thal  the  sampling  matrix  is  the  identity  matrix)  whete 
u  is  an  R-dimensional  frequency  variable  and  v  is  an 
S-dimcnsional  frequency  variable,  wt  can  write 

Gs(s  >  =  J  gs(I  b  I  exp|-j2»  v  T_n  | 
n 


-  £  fn*">cM»H2»v£n| 

n 

»  F*ll  *" )  •  121) 


The  I  ouner  transform  of  the  sequence  gvl  m  )  thus  corresponds  to 
the  evaluation  of  the  Fourier  transform  of  the  sequence  l„(  n  )  on 
an  S-dmu-i.sional  subspace  of  the  R -dimensional  Fourier  space. 

I  or  a  columnwise  lexicographic  ordering  x  ix  a  scalar  and 


or 

F  .fN.s,  s).  <22l 


1 1  \l  (15) 

Thus,  il  follows  that 

m  -Jfn  »  1 1  N,|  f  n,  1  =  n,  »  S,r_s  (16) 


and 

g,(n,  N,n.)  *  fcfn,,n,) .  (17) 

Note  that  in  this  example,  xince  S  -1 ,  the  index  m  reduces  to  a 
scalar.  In  this  case  an  operator  T  '(m|  that  will  invert  the  transfor¬ 
mation  is  given  by 

T  '(m)  *  ln,,n;) 

n>  -  «m)).  \ 

«">»»•., - - - J.  (I*» 

where  t(m))N|  represents  the  evaluation  of  m  with  respect  to  the 
modulus  N|. 

If  the  Fourier  transforms  of  the  sequences  f„(  n )  and  gs<  m )  are 


This  says  that  the  Tourier  transform  of  the  sequence  g,lm|  is  equal 
to  the  Founei  transform  of  the  sequence  M  n )  evaluated  along  a 
single  tine  in  the  2-D  founer  plane.  Because  of  the  periodicity  of 
Fd  u  ).  however,  this  is  equivalent  to  the  evaluation  of  one  period 
of  F  ■(  u  >  on  the  series  of  parallel  lines  shown  in  Fig.  J. 

To  demonstrate  how  the  transformed  signals  can  be  used  for 
linear  filleting,  consider  the  configuration  depicted  in  Fig.  4,  where 
an  S-dimensional  linear,  shiti-invanani  system  is  placed  between  a 
reversible  DRT  and  its  inverse.  If  hst  m )  denotes  the  impulse 
response  (point  spread  function)  of  the  S-dimcnsional  system,  then 

gs(  m  )  =  gs(  m )  *  hs(  m ) ,  (2» 

where  *  denotes  an  S-dimensional  convolution.  In  the  Fourier 
domain 

Gsfv)  -  Gs(7)Hs(f);  (24) 

or 

Fa»r  »>  -  Fa<I  7»H*H  T).  (25) 

w  here  H„(  iu  ( is  the  frequency  response  of  an  R -dimensional  system 
whose  impulse  response  maps  lo  hs(  rh  )  under  the  transformation 
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Fig.  1.  Th«  lines  In  th«  2  0  Fourier  plan*  that  contain  tha  t  o 
Fouriar  Iranatorm  ol  a  columnwise  lexicographically  ordered  2  0 
saquanca. 


Fig.  4.  An  implementation  ol  an  Rxltmensionil  linaar.  ahili- invariant 
system  by  maana  ol  tha  realization  ol  an  S-dimensional  ona. 


T\  The  transitions  from  Eq.  (231  to  Eq.  (241  and  from  Eq.  (24|  to 
fcq.  (25)  arc  both  inscrtihlc. 

If 

F„(u)  »  F,(ir)HR(u).  (2*1 

then 

fR(n)  ’  (27) 

inhere  here  •  denotes  an  R-dimensional  convolution  If  Eq.jl2(>)  is 
true,  then  the  R -dimensional  consolution  of  fK<  n  )  with  hR<  n  )  can 
be  performed  by  performing  the  S-dimenstonal  consolution  of 
fjl  m )  with  hs<  m  >.  Equation  (25)  says  that  this  result  is  true  on  a 
subspace  of  the  Fourier  plane.  The  question  is:  when  are  we 
guaranteed  that  it  is  true  in  the  whole  R-dimensional  Fourier  space? 
This  is  simply  the  Fourier  domain  statement  of  the  requirement 
(hat  the  DRT  described  byj.be  insertible  on  the  lattices  containing 
gs(  m  ).  gs(  m ),  and  hs( in ).  (Note:  the  insernbility  of  the  transfor¬ 
mation  operation  does  not  depend  upon  the  inscrtibility  of  the 
matrix  X:  If  R  *  S,  the  matrix  T.  will  never  be  insertible.) 

Since  the  R -dimensional  and  S-dimensional  convolutions  become 
equivalent  if  the  DRT  is  invertible,  we  see  that  we  could  just  as  well 
perform  an  S-dimensional  convolution  by  performing  an 
R-dimensional  one.  Thus,  the  system  in  Fig.  4  could  be  used  if 
S  >  R  by  simply  interchanging  |b*  DRT  and  its  inverse. 

It  should  be  emphasized  _thal  the  fact  that  gs(m)  is  invertible 
does  not  guarantee  that  gs<  m  )  will  be  invertible  also.  Consider  as 
an  example  the  convolution  of  two  5  x  5  arrays  by  means  of  the  I  -D 
rowwise  lexicographic  ordering._The  true  convolution  of  the  two 
arrays  is  a  9  x  9  point  array.  f„(  n ).  If  we  were  to  use  the  transfor¬ 
mation  matrix 


X  -  II  S| 

however,  we  would  see  that  while  this  is  invertible  for  the  5  x  5  se¬ 
quences  being  convolved  h  is  not  invertible  for  their  9x9  convolu¬ 
tion.  The  ttansfotmation 

T  -  |l  9| 

is  invertible  for  all  three  sequences.  With  this  transformation,  the 
5x5  sequences  are  considered  as  9  x  9  sequences  by  appending 
samples  of  value  zero  before  the  DRTs. 

4.  DIMENSION  A  I.IT5-CHANf;l\G 
TRANSFORMATIONS  FOR  NONRECTANC.L’LARLY 
SAMPLED  SIGNALS 

At  this  point  we  w  ould  like  to  combine  the  results  of  Secs.  I  and  2 
to  discuss  dimensionality-changing  transformations  for  signals  that 
are  tepresented  on  nonrectangular  sampling  lattices,  because  of  the 
matrix  notation  already  established,  this  is  straightforward  to  do. 

As  before,  let  f R(  n )  be  an  R-dimensional  sequence  that  is  pro¬ 
jected  onto  an  S-dimensional  sampling  lattice  with  R<S  and 
transformation  matrix  T.  Thus 

gs(Jn )  fR(n).  (28) 

Now.  however,  we  will  assume  that  the  sequences  fR(  n  )and  gs(  m  ) 
correspond  to  nonrectangular  samples  of  the  continuous  R  and 
S-dimensional  signals  f,  R|  t")  and  gts(r).  It  we  denote  the  two 
sampling  matrices  by  >i  and  V,  then 

(,(«)=  fu(X»)  (29) 

gslm)  x  gts(Vm).  (30) 


The  matrix  is  R  x  R  and  V.  is  S  *  S.  As  an  example.  fR(  n  )  could 
represent  a  3-D  signaljhat  has  been  sampled  on  a  body-centered 
cubic  lattice,  and  gs(  Til )  could  be  samples  taken  on  a  2-D  hex¬ 
agonal  lattice.  _  _ 

The  Fourier  transforms  of  the  sequences  fR(  n )  and  gs(  m )  are 
given  by 


F„(u)  -  £  fR(  n)exp|-j2eu  ?sn|  (31) 

n 

Cisl  s’)  =  J  fs(m»*'PH2*v  Vm|.  (32) 

m 

By  substituting  Eq.  (28)  into  Eq.  (32),  we  see  that 

G,,(7).  £  fR(n) exp|-j2xTXInl  (33) 

n 

“  »>((II£ ')■“)■  ,J4) 


As  we  saw  in  the  rectangular  case,  again  we  observe  that  the  linear 
transformation  between  n  and  m  induces  a  linear  transformation 
between  u  and  s'. 

As  a  special  case,  we  can  consider  the  rowwise  lexicographic 
ordering  of  the  N,  x  N.  sample  hexagonal  lattice  shown  i g  Fig.  5  in¬ 
to  a  l-D  sequence.  For  this  example  R  •  2.  S  •  I, 
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Fig.  6.  Th«  lines  In  the  2-D  Fourier  plane  (he!  contain  I  he  i-D 
Fourier  Irentlorm  of  I  he  leiicographtcallg  ordered  hexegonally 
sampled  2-0  signal  of  Eg.  (27). 
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Mere,  as  in  (he  rectangular  case,  the  l-D  Fourier  transform  of  (he 
lexicographically  ordered  sequence  corresponds  lo  the  I-D  Fourier 
transform  evaluated  along  a  straight  line.  Due  to  the  periodicity  of 
F„(  u ),  however.  this  is  eqursalent  lo  eval'iating  one  period  of 
F.(  u )  on  a  scries  of  interlaced  parallel  straight  lines  as  shown  in 
Fig.  6. 

lf,T  is  an  inscriible  operation,  then  R-dimcnsinnal  linear,  shift- 
invariant  operations  can  be  performed  on  fK(  n)  by  implementing 
an  S-dimensional  linear,  shifl-insanani  system  to  operate  on 
ls|  h  I-  In  this  respect  tFcic  is  no  difference  between  tlic  rect¬ 
angular  and  nonreciangular  cases. 

The  reason  for  using  a  dimensionality-changing  transformation 
for  signal  processing  is  primarily  for  implementation  convenience. 
Because  these  transformations  do  not  alter  the  total  number  of  data 
samples  involved,  merely  the  format,  they  do  not  result  in  com¬ 
putational  savings.  However,  if  hardware  or  software  or  optics  is 
available  for  processing  2-D  signals,  dimensionality -changing 


transformation  allow  such  a  system  lo  be  used  for  processing  |-D 
or  3-D  signals.  On  the  other  hand,  nonreciangular  sam  sling  lattices 
can  be  more  efficient  than  rectangular  ones  for  representing  certain 
bandlimited  signals.  Since  they  can  result  in  a  lower  sampling  den- 
sity,  they  can  require  less  storage  and  fewer  arithmetic  processing 
computations.  As  we  have  seen  in  this  paper,  it  is  not  necessary  lo 
sacrifice  the  convenience  of  a  dimensionality-changing  transforma¬ 
tion  10  Use  these  nonreciangular  sampled  representations. 
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This  thesis  addresses  the  problem  of  designing  efficient 
algorithms  for  the  evaluation  of  general  periodically  sampled  multi¬ 
dimensional  discrete  Fourier  transforms. 

The  contributions  of  this  thesis  may  be  roughly  divided  into 
three  categories.  First,  a  new  mathematical  formulation  for  the 
multidimensional  discrete  Fourier  transform  is  introduced.  With  this 
approach  the  DFT  indices  are  viewed  as  elements  of  a  lattice 
structure,  and  thus  geometric  techniques  can  be  used  to  manipulate 
them.  Second,  it  is  recognized  that  a  crucial  step  in  the  design  of 
new  algorithms  is  the  formulation  of  a  Chinese  remainder  theorem  for 
integer  vectors.  Next,  this  theorem  is  applied  to  provide  an  index 
map  which  is  similar  to  Good's  prime  factor  map.  Third,  a  general 
class  of  fast  algorithms  is  derived. 

The  Smith  normal  decomposition  is  used  to  factor  the  periodi¬ 
city  matrix  which  plays  an  important  role  in  the  design  of 
algorithms.  This  decomposition  leads  to  a  complete  characterization 
of  the  matrix  DFTs.  It  is  found  that  the  DFTs  cannot  be  uniquely 
defined  solely  by  their  lengths,  as  is  the  case  for  one-dimensional 
DFTs.  K  second  attribute,  the  form,  is  needed.  This  form  is  related 
to  the  non-equivalent  Smith  normal  forms. 

The  first  algorithm  presented  uses  the  Smith  normal  fora  to 
decoapose  a  general  DFT  into  a  rectangular  one.  This  decomposition 
is  shown  to  include  no  multiplications  and  consists  only  of  input  and 
output  permutations  of  the  data.  This  algorithm  is  essentially  use¬ 
ful  for  short  DFTs.  The  second  algorithm  presented  is  the  Matrix 


Prim*  rector  Algor  it  h»  which  compute*  long  transform  by  nesting 
short  DPTs  together.  It  is  shown  that  every  kind  of  nesting  can  be 
used.  A  derivation  is  given  which  solves  the  important  indexing 
problem  for  any  particular  form.  As  an  example,  a  general  length 
hexagonal  PPT  is  designed.  Finally,  an  answer  is  given  to  the  pro¬ 
blem  of  finding  an  optimal  periodicity  matrix  for  a  given  finite 
rectangular  area  sequence. 


nmooocriOM 


The  development  of  fast  algorithms  for  the  computation  of  the 
Discrete  Fourier  Transform  (DPT)  had  a  major  impact  on  the  field  of 
digital  signal  processing.  The  Past  Fourier  Transform  (PIT)  has 
become  an  excellent  tool  for  performing  spectral  analysis  and  linear 
filtering  (!)  -  (4J.  The  motivation  for  developing  FFT  algorithm  is 
rooted  in  the  fact  that  the  direct  computation  of  DFTs  requires  a 
number  of  operations  which  becomes  rapidly  excessive  for  long 
transforms.  The  savings  in  computations  made  by  the  FFT  is  a  direct 
result  of  advantageous  tradeoffs  of  control  complexity  against 
arithmetic  operations  count.  Great  efforts  have  gone  into  finding 
efficient  FFTs  for  one-dimensional  signals.  Because  the 
multidimensional  DPT  has  a  more  complex  structure  than  its  one¬ 
dimensional  counterpart,  it  has  only  recently  become  the  subject  of 
active  research. 

Multidimensional  signal  processing  has  become  a  major  area  of 
study  |5).  It  has  applications  in  important  fields  ranging  from 
image  processing,  to  antenna  design,  to  geophysics  and  optics.  The 
motivation  for  developing  multidimensional  FFTs  is  higher  because 
multi-dimensional  signals  are  typically  characterised  by  massive 
amounts  of  data  and  a  large  quantity  of  numerical  computations. 
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The  first  important  FIT  was  developed  by  Cooley  and  Tukey  In 
19(5  ((].  They  observed  that  the  common  strategy  of  'divide  and 
Conquer*  could  be  used,  in  a  power  of  two  length  OFT.  They  evaluated 
a  large  DPT  by  dividing  it  into  successively  saaller  DFTs,  until 
eventually  only  length  two  DFTs  are  left,  and  the  entire  computation 
is  dominated  by  the  so-called  ‘twiddle*  operations  between  the 
stages. 

The  Common  Factor  Algorithm  proposed  by  R.  C.  Singleton  in 
1969  (7)  is  an  extension  of  the  same  idea  to  the  case  where  the 

length  of  the  OFT  is  arbitrary.  It  builds  up  a  large  DFT  out  of  many 
saaller  OFT  algorithms,  connected  by  twiddle  factors.  Great  efforts 
have  jone  into  reducing  the  number  of  twiddle  operations  in  the 

Cooley-Tukey  algorithm. 

Another  way  of  connecting  small  DFTs  is  to  use  the  prime 

factor  index  map  proposed  by  I.  J.  Good  in  1958  (8).  This  method 
eliminates  the  twiddle  factors  entirely  but  requires  the  constituent 
small  DFT  algorithms  to  have  prime  lengths.  Unfortunately,  there 
were  no  efficient  prime  length  DFTs  at  the  time  the  algorithm  was 

first  proposed.  Good's  index  map  was  revised  in  1977  when  Kolba  and 
Parks  proposed  their  Prime  Factor  Algorithm  (PFA)  (9).  The  key  step 
leading  to  the  PFA  was  the  introduction  in  1976  by  Winograd  { 70 )  of  a 
number  of  very  efficient  small  prime  and  power  of  a  prime  length  DFT 
algorithms.  Winograd  combined  the  idea  of  converting  a  DFT  to  a 
circular  convolution,  originally  described  by  Rader  (11]  in  1968, 
with  minimum  multiplication  convolution  algorithms  he  developed  in 
1975  (12). 


The  moat  important  factor  in  the  development  of  new  algorithms 
was  the  recognition  by  Winograd  that  DFTs  can  be  viewed  as  operations 
defined  in  finite  rings.  More  specifically,  the  indices  in  the  OFT 
are  considered  as  elements  of  a  ring  and  it  is  the  structure  of  this 
ring  which  is  exploited  to  develop  new  algorithms.  This  new  number 
theoretic  point  of  view  has  allowed  both  derivation  of  some  lower 
computational  complexity  bounds  and  design  of  new  and  improved  compu¬ 
tation  techniques. 

The  first  method  used  to  evaluate  multidimensional  DFTs  was 
the  row-column  method  which  divides  the  computation  into  the  compu¬ 
tation  of  a  number  of  one-dimensional  DFTs  (13).  A  more  efficient 
approach  is  the  vector-radix  algorithm  which  is  an  extension  of  the 
Cooley-Tukey  algorithm  to  the  multidimensional  case  [141  ~  [18). 
Nussbaumer  (26),  (271  used  polynomial  theory  to  extend  Hinograd's 
techniques  to  the  multidimensional  case.  All  these  multidimensional 
algorithms  apply  to  the  so  called  "rectangular*  DFTs  which  involve 
rectangularly  sampled  signals.  R.  M.  Mersereau  [19],  (20)  exhibited 
signals  sampled  on  non-rectangular  rasters.  Mersereau  and  Speake 
extended  in  1981  (21)  the  Cooley-Tukey  algorithm  to  the  general 
periodically  sampled  multidimensional  case.  They  have  shown  that 
both  the  row-column  and  the  vector-radix  algorithm  can  be  derived  as 
special  cases  of  their  algorithm. 

Scope  of  Thesis 

Due  to  its  theoretical  as  well  as  practical  importance,  this 
thesis  considers  the  problem  of  developing  efficient  algorithms  for 
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the  evaluation  of  a  general  periodically  sampled  multidimensional 
OFT.  The  contribution  of  this  thesis  may  be  roughly  divided  into 
three  categories.  First,  we  formulate  a  new  mathematical  format  for 
the  multidimensional  OFT.  More  specifically,  we  view  the  OFT  indices 
as  vectors  in  a  lattice  structure.  With  this  mathematical 
foundation,  it  is  possible  to  use  geometric  techniques  for  manipu¬ 
lating  these  indices.  Second,  from  the  Insight  gained  from  this 
approach  we  are  able  to  recognize  that  a  crucial  step  in  the  design 
of  new  algorithms  is  the  formulation  of  a  theorem  which  is  equivalent 
to  the  Chinese  Remainder  theorem  for  integers  [22].  This  theorem  is 
used  to  provide  an  index  map  which  is  similar  to  Good's  prime  factor 
map  [8].  Third,  we  investigate  the  applications  of  this  novel 
approach  to  the  design  of  new  and  improved  algorithms.  In 
particular,  we  provide  an  extension  of  the  PFA  and  the  winograd 
Fourier  Transform  Algorithm  [3]  to  the  general  multidimensional 
case.  When  implementing  the  multidimensional  FFA,  it  is  found  that 
the  multidimensional  indexing  problem  is  quite  complex.  We  provide  a 
method  which  solves  the  problem  in  a  satisfactory  manner. 


Outline  of  Thesis 

The  thesis  is  divided  into  several  parts.  In  Chapter  II  we 
begin  by  Introducing  the  definition  of  a  general  periodically  sampled 
multidimensional  DFT.  Then,  the  matrix  Cooley-Tukey  algorithm  is 
explained,  in  order  to  have  a  better  grasp  of  the  difficulties 
Involved  in  developing  multidimensional  algorithms.  Next,  we  give  a 
presentation  of  some  results  from  lattice  theory  that  are  going  to  be 
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used  In  the  remainder  of  the  thesis.  .  He  will  also  refornulate  the 
multidimensional  DPT  in  this  new  mathematical  context.  It  is  known 
that  the  factorization  of  the  periodicity  matrix  plays  an  important 
role  in  the  design  of  algorithms.  To  help  in  this  factorization,  the 
Smith  Noraal  decomposition  is  used.  Moreover,  using  the  Smith  Normal 
decomposition,  we  will  be  provided  with  a  means  for  classifying 
multidimensional  DFTs.  We  find  that  the  DFTs  cannot  be  uniquely 
defined  by  their  lengths,  as  is  the  case  for  one-dinensional  DFTs.  A 
second  attribute,  the  form,  is  needed.  This  form  is  closely  related 
to  non-equivalent  Smith  Normal  forms. 

In  Chapter  III,  we  start  by  presenting  an  algorithm  that  uses 
the  Smith  Normal  decomposition.  The  algorithm  transforms  the  DFT 
into  a  rectangular  DFT  which  can  be  evaluated  by  more  conventional 
methods.  Then,  a  Chinese  Remainder  theorem  for  integer  vectors  is 
proved.  The  theorem  is  used  to  provide  an  index  map  for  a  new  class 
of  DFT  algorithms.  From  this  general  class,  we  discuss  in  detail  the 
Multidimensional  Prime  Factor  Algorithm  (MPFA) .  The  MPFA  is  just  one 
particular  case  of  nesting  the  short  DFTs  (or  modules)  together.  A 
Winograd  type  of  nesting  results  in  multidimensional  HFTA. 

The  indices  in  the  MDFT  are  vector  elements  and  as  such  are 
difficult  to  handle,  both  from  an  arithmetic  point  of  view  and  an 
algorithmic  point  of  view.  As  an  interesting  byproduct  of  lattice 
theory,  it  is  »!<own  how  to  represent  these  indices  in  a  manner  which 
is  similar  to  the  representation  of  indices  of  rectangular  DFTs.  for 
this  reason,  we  call  thi3  process  the  rectangularizatirn  of  indices. 
It  has  practical  applications  in  algorithmic  design. 


In  Chapter  IV  we  consider  the  practical  side  of  the  thesis, 
we  construct  and  test  a  set  of  multidimensional  FFT  algor  it  has.  The 
highly  complex  problem  of  multidimensional  indexing  is  addressed  and 
a  solution  to  it  is  offered.  He  describe  both  general  length  and 
general  form  algorithms  such  as  the  U.  Jl  V.  algorithm  and  the  HPFA 
algorithm  and  a  general  length,  but  a  specific  form,  hexagonal  PPA 
algorithm.  The  methodology  offered  will  allow  the  design  of  any 
other  specific  form  general  length  algor. '.hm.  We  also  consider  the 
problem  of  finding  an  optimal  periodicity  matrix  when  given  a  finite 
area  sequence  and  a  particular  DFT  algorithm.  Finally,  Chapter  V 
concludes  the  thesis  with  a  summary  of  the  results  presented  in  the 
previous  chapters  and  some  recommendations  for  future  research  are 
described. 

It  is  important  to  note  here  that  all  the  results  that  are 
presented  in  this  thesis  apply  to  systems  with  more  than  two 
dimensions.  But  for  reasons  of  clarity  and  ease  of  presentation, 
sometimes  only  the  two-dimensional  case  is  treated.  However,  in 
these  cases  the  extension  from  two  to  higher  dimensions  can  be 
realized  in  a  straightforward  manner. 
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CHAPTER  II 

BACXGROOHD  MATERIAL 

The  tools  needed  for  the  analysis  of  the  multidimensional  DPT 
are  presented  in  this  chapter.  After  some  preliminary  material 
covering  important  characteristics  of  the  DPT  itself,  a  derivation  of 
the  multidimensional  Cooley-Tukey  algorithm  is  given  which  closely 
follows  that  presented  in  (21].  The  discussion  of  the  algorithm 
serves  two  purposes.  First,  it  provides  an  example  of  the  concepts 
involved  and  of  the  difficulties  encountered  when  generalizing 
one-dimensional  results  to  the  multidimensional  case.  Secondly,  the 
limitations  of  the  algorithm  motivate  this  thesis.  In  the  third 
section,  some  theorems  and  lemmas  from  a  branch  of  mathematics,  known 
as  lattice  theory  ->re  introduced.  These  will  form  the  basis  for 
deriving  a  multidimensional  extension  to  the  classical  Chinese 
Remainder  Theorem  of  number  theory.  The  extension,  in  its  turn, 
forms  a  pillar  on  which  a  derivation  of  a  new  large  class  of  DFT 
algorithms  will  be  based. 

A  simple,  special  case  of  the  multidimensional  DPT  is  the 
rectangular  DPT.  Many  relatively  efficient  algorithms  such  as  the 
row-column  algorithm,  the  vector  radix  algorithm  and  many  generali¬ 
sations  [4],  have  been  developed  for  the  computation  of  rectangular 
DFTs.  Therefore,  a  procedure  which  reduces  a  general  DFT  into  a 
rectangular  one  is  highly  desirable.  However,  the  reduction,  to  be 
useful,  shouldn't  demand  great  cost  nor  great  effort.  Such  a  proce¬ 
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dure  will  be  developed  in  the  next  chapter.  It  requires  the 
decomposition  of  the  periodicity  matrix  into  a  special  form  called 
the  Smith  normal  form.  In  section  four  a  theorem  is  proved  that 
addresses  that  concern.  In  addition,  it  provides  us  with  an  easily 
proqrammable  procedure  for  findinq  integer  factors  of  integer 
matrices.  In  the  one-dimensional  case  whe'  N  (the  length  of  the  DPT) 
is  factored  into  relatively  prime  factors,  factorization  of  the 
periodicity  matrix  constitutes  the  starting  block  for  algorithms.  A 
number  of  examples  are  presented  to  illustrate  the  different  con¬ 
cepts. 


Matrix  DPT 

The  Discrete  Fourier  Transform  (DPT)  is  often  described  as  an 
invertible  linear  transformation  which  operates  on  complex  valued 
vectors.  For  each  integer  N>1,  the  length  M  DPT  relates  an  N- 
dimens ional  complex  vector  into  another  complex  vector.  It  is 
defined  as: 


N-1 

ylk)  -  l  x(n)  expl-j  '—5—  )  (t) 

n-0  N 

Equation  (1)  represents  a  one-dimensional  DPT  because  the  signals 

x(n)  and  y(k)  are  functions  of  one-dimensional  variables  n  and  k 

respectively.  As  expressed  in  (1),  explicit  computation  of  the  DPT 

2 

requires  on  the  order  of  N  operations  which  becomes  rapidly  exces¬ 
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sive  for  large  N.  The  simplicity  of  equation  (?)  disguises  the  fact 
that  there  are  many  redundant  operations  in  such  a  computation. 
These  redundant  operations  arise  from  the  fact  that  the  weighting 


function  (the  complex  exponential)  lx  periodic  in  both  n  and  Ic  with 
period  N.  furthermore,  the  equation  nay  also  exhibit  complex  conju¬ 


gate  symmetries.  Consequently  with  careful  tabulation  of 
internediate  results,  it  is  possible  to  substantially  reduce  the 
number  of  multiplications  required  to  calculate  the  ti  values  of  the 
sequence  y(k).  This  is  the  essence  of  all  Past  Fourier  Transform 
algorithms  (FFT) . 

The  multidimensional  DPT  (natrix-DFT)  may  occur  in  two 
different  contexts.  First,  it  applies  to  signals  which  are  function 
of  more  than  one  variable.  For  example,  a  sampled  picture  is  a  two 
dimensional  signal,  since  the  light  intensity  varies  with  the  two 
spatial  coordinates  of  the  image.  Other  examples  exist  in  areas  such 
as  optics,  x-ray  crystallography  and  antenna  design.  Second,  there 
is  often  a  direct  relationship  between  one-dimensional  DFTs  and 
matrix-DPTs.  Indeed,  a  matrix-DFT  can  be  derived  from  a 
one-dimensional  DPT  and  vice-versa.  Two  commonly  used  PFT 
algorithms,  the  Prime  Factor  algorithm  (PFA)  and  the  Winograd 
algorithm,  are  based  on  a  special  mapping  of  one-dimensional  signals 
into  multidimensional  ones. 

As  in  the  one-dimensional  case  it  is  the  inherent  periodicity 
of  the  weights  in  the  matrix-DFT  which  is  to  be  exploited  in  order  to 
produce  efficient  algorithms.  But  since  each  dimension,  in  general, 
has  an  effect  on  the  other  dimensions,  this  exploitation  is  not 
straightforward.  The  multidimensional  nature  of  the  signals  require 
vector  and  matrix  arithmetic  for  compact  notation  and  for  the  ability 
to  readily  apply  previously  derived  results  for  the  one-dimensional 
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Drt.  It  1*  known  that  there  are  many  different  ways  for 
representing,  or  sampling,  multidimensional  bandlimited  signals 
(5).  Mersereau  has  developed  a  matrix  description  of  multidimen¬ 
sional  sampling  and  digital  signal  processing  |20].  A  summary  of 
these  portions  of  this  background  which  are  needed  to  understand  the 
thesis,  is  given  below. 

Let  *A(tJ  denote  an  M-dimensional  analog  siqnal  where  _t_  is  a 
vector  of  M  independent  variables.  A  sampled  representation  of  this 
signal  is  given  by: 


x (n)  -  x#{V  n)  (2) 

where  n_  is  an  integer  vector  of  dimension  M  and  V_  is  an  M*M  matrix  of 
real  numbers  that  defines  the  locations  of  the  sampled  values  (Figure 
1).  In  order  to  have  a  nondegenerate  set  of  samples,  the  columns  of 
V  must  be  linearly  independent.  Different  matrices  V.  “ill  lead  to 
different  sampling  strategies.  By  careful  choice  of  V  a  represen¬ 
tation  can  be  obtained  which  results  in  a  minimum  sampling  density. 
The  most  common  sampling  scheme  is  the  rectangular  one  which 
corresponds  to  the  case  where  V  1b  a  diagonal  matrix.  In  the  2-D 
rectangular  case: 


<n, 


n2> 
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x(n)  ■  *,(«», Tt»  n2T2* 


where  *  denotes  the  transposition  operation  and  T,  and  T?  are  the  row 
and  column  sampling  periods. 

Another  scheme  of  practical  interest  is  the  hexaqonal  scheme 
discussed  by  Mersereau  (19).  For  this  scheme,  the  sampling  matrix  V_ 
has  the  form 


v 


It  has  been  shown  that  hexagonal  sampling  is  optimal  for  cir¬ 
cularly  band limited  waveforms  (23).  It  requires  13.4  percent  fewer 
samples  than  rectangular  sampling. 

The  Oiscrete  Fourier  Transform  is  commonly  introduced  by  means 
of  periodic  sequences.  Suppose  x(n)  is  a  periodic  sequence  of 
samples  with  periodicity  N,  i.e. 

x(n)  -  x(n  ♦  H  jr »  (3) 
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where  x_  is  any  integer  vector  and  in  an  integer  Matrix  whose  deter¬ 
minant  is  nonzero.  It  follows  that  xin)  is  periodic  in  the  M 
different  directions  which  are  defined  by  *J>e  vectors  formed  from  the 
columns  of  N_  (Figure  2).  Let  denote  a  region  in  the  _n-plane  con¬ 
taining  one  period  of  x(n).  I„  is  not  unique  but  always  contains 
|det(N)|  samples  of  x(n)  (where  f  *  I  denotes  the  absolute  value). 
x(jj)  can  then  be  exactly  represented  by  a  set  of  Fourier  series 
coefficients  which  will  be  denoted  by  X(jc),  where 


x(n)  -  (  1  X(M  expl*j2*k  N-’nl 


X Cje_>  -  \  x(n)  exp[-j2*k^M  'nl 


neIn 


The  sequence  of  coefficients  x(M  is  periodic  with  periodicity 
>J_'  and  JN  denotes  a  set  of  samples  in  one  period  of  X (_k> . 

Now,  let  x(ji)  be  a  sequence  with  finite  support  on  IN>  i.e. 
x (n)  is  zero  for  values  of  n_  not  in  the  reqion  IN.  Next,  construct  a 

sequence  x(n)  of  period  N_  which  is  equal  to  x(n)  in  the  region  IN. 

Thus,  x (_n>  can  be  completely  specified  by  x(n)  and  vice-versa.  If 
X(k_)  is  defined  as  one  period  of  X(JO»  the  discrete  Fourier  series 

coefficients  of  x(n)»  then  x(n)  can  be  related  to  X(jt)  and 

vice-versa.  This  relation  defines  the  Discrete  Fourier  Transform 
(OFT) < 

! 
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*<n>  -  | (nj  |  1  **!>  explej2*kV\il 


-  i«„ 


X(k)  -  l  x(n)  •*pl-j2n*M“,n) 


The  numbers  X(k_)  can  be  interpreted  as  samples  of  the  Fourier 
transform  of  the  sequence  x(ji).  The  eanplinq  matrix  in  the  Fourier 
domain  Is  then  [SI  s 
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The  matrrx-DFT  can  be  used  to  compute  a  circular  convolution 
of  multidimensional  signals.  This  is  the  connection  between  the  DFT 
and  linear  filtering  that  is  exploited  in  order  to  create  linear 
filtering  algorithms  with  the  DFT.  Other  uses  for  the  DFT,  along 
with  many  of  its  interesting  properties,  are  discussed  at  length  in 
general  references  (1]  -  (3). 

This  thesis  is  concerned  with  methods  for  evaluating  (5)  for 
an  arbitrary  periodicity  matrix  21*  Most  of  the  methods  that  exist 
treat  only  the  case  where  _N_  is  a  diagonal  matrix.  In  that  case,  the 
matrix-DFT  is  said  to  be  rectangular  because  it  relates  a  rectangu¬ 
larly  sampled  signal  to  its  rectangularly  sampled  Fourier 
transform.  In  this  case 
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and  equation  (S)  becomes: 


n,k, 


X(k1,k2,...,kM)  -  l  l  ...l  x(n1,n2>...,nM)  exp(-j2»  — —  )■ 

"l  n2  nM  1 


Vk. 


n?k2 

exp[-j2»  -jj— !•••  expl-j2*  -jpM 


The  kernel  (exponential)  has  been  factored  into  independent 
kernels.  Evaluation  along  any  dimension  can  therefore  be  done  inde¬ 
pendently  of  the  other  dimensions.  The  row-column  algorithm  is  an 
example  of  such  an  evaluation.  With  this  algorithm,  one-dimensional 
DFTe  are  computed  sequentially  with  respect  to  each  variable  nA  for 
each  value  of  the  remaining  variables.  Another  approach  is  the 
vector  radix  EFT  algorithm  in  wnich  the  OFT  is  broken  down  into  suc¬ 
cessively  smaller  DFTs  until  only  trivial  DFTs  need  to  be  evaluated. 

A  number  of  efflciint  algorithms  have  been  developed  more 
recently  (bj  -  (7J.  Among  them  are  the  Prime  Factor  Aloorithm  (PFA) 
and  the  Winograd  Fourier  Transform  Algorithm  (WFTA).  Both  are  based 
on  a  reduction  of  long  length  DFTs  into  a  number  of  smaller  ones. 
Moreover,  the  small  DFTs  (called  modules)  which  are  to  be  combined 
are  constructed  to  be  highly  efficient.  Their  efficiency  is  the 
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reful*  of  two  developments.  One  le  the  conversion  of  the  DFTa  into 
circular  convolutions  by  a  method  initially  introduced  by  Rader 
(11).  The  conversion  is  done  by  a  simple  rearrangement  of  input  and 
output  samples.  The  other  development  is  the  presentation  by 
Winograd  of  minimal  algorithms  for  the  computation  of  convolutions. 
The  PFA  and  the  WFTA  differ  in  their  internal  orderings  of 
operations,  which  affect  their  overall  operation  counts  and  program 
control  complexity.  A  generalization  of  these  two  algorithms  will  be 
given  in  Chapter  III. 

An  important  example  of  a  non-rectanqular  matrix-DPT  is  the 
two-dimensional  hexagonal  OFT  which  has  been  studied  in  (19);  the 
periodicity  matrix  has  the  form: 

N2' 

N  • 

.N2  *1. 

where  Nj  and  H2  are  integers.  A  row-column  type  and  vector  radix 
type  algorithm  for  the  hexagonal  DFT  are  given  in  (19).  It  is  shown 
that,  for  the  same  frequency  resolution,  the  hexagonal  DFT  requires 
25t  less  storage  capacity  and  251  less  computation  than  its  rectangu¬ 
lar  counterpart. 


Matrix  Cooley-Tukey  Algorithm 

One  of  the  earliest  PFT  algorithms  was  the  popular  Radix-2 
algorithm  known  as  the  Cooley-Tukey  FFT  (61 .  Mersereau  and  Speake 
have  shown  that  algorithms  of  the  Cooley-Tukey  type  exist  for  the 


Ms  denote  _■  »  ( (n)  )H  if  j*  is  congruent  to  _n  (modulo  N)  and  is  con¬ 
tained  in  IN> 

Any  vector  £  in  the  region  can  be  uniquely  expressed  as  _n 
»  ( (Pq  *  £)  )N  where  £  belongs  to  a  set  Ip  and  £  belongs  to  a  set 
IQ.  Ip  contains  Idet  (P)  I  vectors,  IQ  contains  Idet  Qj  vectors. 

By  expanding  the  indices  and  the  exponential  in  equation  (5) , 
the  DFT  can  !x>  decomposed  into  two  parts: 


C(£.ll 


al  x(  (Pq+p)  )Nexp[-j2»(l_'+m’0>Q  'a) 

*£Iq 


(7) 


X(Q*m+l)  -  l  C(£,l|expC-j2*l'M",£jexpI-j2«m*p"1£l 

EeIP 


This  algorithm  decomposes  the  DFT  into  a  series  of  Idet  (£)  I 
matrix-Q  DFTs,  a  series  of  Idjt  (Q_)l  matnx-P_  DFTs  and  Idet  ( N)  I 
twiddle  factor  multiplications  (multiplications  by  expl-j_l_’  (2»(l  'jj)  J . 

These  relations  represent  the  first  level  of  decomposition  of 
a  decimation-in-time  Cooley-Tukey  PFT  algorithm.  It  is  clear  that  a 
different  factorisation  of  N_  leads  to  a  different  decomposition  and 
therefore  to  another  algorithm.  Thus  the  matrix  Cooley-Tukey 


algorithm  is  in  fact  a  collection  of  algorithms  which  differ  only  by 


the  way  the  periodicity  matrix  ia  factored.  How  to  factor  N  to 
obtain  a  suitable  algorithm  is  an  open  problem.  A  tentative  solution 
to  that  problem  will  be  given  in  the  next  chapter. 


The  number  of  complex  multiplications  is  often  given  as  a 
measure  of  comparison  between  algorithms.  Let  CN  denote  the  number 
of  complex  multiplications  for  the  matrix  Cooley-Tukey  FFT  algorithm, 
then 

CM  -  Idet  PI  C  ♦  Idet  01  C„  ♦  Idet  N| 

H  —  Q  —  P  —  (8) 

where  Cg  and  Cp  represent  the  computational  complexity  of  the 
matrix-4  DFT  and  the  matrix-P_  DPT,  respectively.  The  final  term  in 
equation  (8)  corresponds  to  the  number  of  multiplications  by  the 
twiddle  factors. 

Both  the  row-column  decomposition  and  the  vector  radix 
algorithm  for  rectangular  DFTs  can  be  shown  to  be  special  cases  of 
the  matrix  Cooley-Tukey  algorithm.  For  example  if 


then  a  row-column  algorithm  corresponds  to  the  factorization 
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while  it  Mj-Nj-N  and  N  is  a  power  ot  two .  vector-radix  (2*2)  FFT 
algorithm  la  a  result  of  the  factorization 

-ni: ;]-[  °j 

It  is  known  that  the  computational  complexity  of  a  one-dimen¬ 
sional  OFT  is  ultimately  linked  to  its  length.  In  the 
multidimensional  case,  it  is  intuitively  clear  that  the  computational 
complexity  depends  not  only  on  the  length  of  the  matrtx-DFT  but  also 
on  the  form  of  the  periodicity  matrix  N.  For  example,  it  is  known 
that  a  DPT  with  periodicity  matrix 


N 


0 

2 


] 


can  be  evaluated  with  less  computations  than  a  DFT  with  periodicity 


although  both  have  length  four.  An  open  problem  is  then  how  to 
classify  matrix-DFTs  both  according  to  length  and  form.  An  answer  to 
that  problem  will  be  given  in  section  four  where  it  is  shown  that  the 


>3? 


JfU- 


multiplicative  complexity  of  a  matrix  -  N_  DPT  can  be  deduced  from  the 
Smith  Normal  decomposition  of  N. 


The  matrix  Cooley-Tukey  algorithm  uses  one  particular  Indexing 
scheme  whereby  the  indices  n_  and  k_  ace  mapped  to  a  pair  of  indices 
011 '£2*  and  (Jli  > Jlj >  •  i»  Interesting  to  ask  whether  a  different 
scheme  sight  not  lead  to  a  different  class  of  algorithms.  It  is  to 
answer  this  question  that  we  have  been  let  to  consider  lattice 
theory. 

A  lattice  structure,  as  it  will  be  seen  in  the  next  section, 
integrates  both  the  vectorial  nature  of  the  indices  and  the  inherent 
periodic  nature  of  the  matrix-DFT. 

Results  from  Lattice  Theory 

The  new  algorithms  and  results  that  will  be  proposed  are  de¬ 
rived  from  geometric  number  theory,  and  some  knowledge  of  this  topic 
is  necessary  to  understand  these  algorithms  and  to  use  them  in 
practical  applications.  For  this  purpose,  this  section  is  lntendeu 
to  familiarize  the  reader  with  geometric  number  theory.  From  the 
insight  gained  we  will  be  able  to  formulate  and  prove  an  important 
theorem  which  resembles  the  Chinese  Remainder  theorem  for  integers. 
The  new  theorem  will  play  a  central  role  in  the  process  of  generaliz¬ 
ing  both  the  Prime  Factor  Algorithm  and  the  Winograd  Fourier 
Transform  Algorithm  to  the  multidimensional  case.  Additionally,  we 
will  be  able  to  find  a  procedure  by  which  a  DFT  with  an  arbitrary 
periodicity  matrix  is  transformed  into  a  form  where  the  indices  are 
rectangularly  distributed.  Using  this  transformation  the  DFT  can  be 
converted  into  either  a  one-dimensional  DFT  or  into  a  rectangular 


\>V'. 


In  th*  following  we  define  a  Multidimensional  lattice  and 


present  some  useful  theorems. 

Definition!  Let  a,  ,a,( . . . , be  M  (M>2)  linearly  independent  vectors 
in  the  M-dinensional  real  Euclidian  space.  The  set  of  vectors 

*  “  ♦  “2«2  ♦  —  +  Vm  (9> 

with  integers  Uj,...,un  is  called  the  lattice  with  basis  a1>...,aH. 

Equations  (9)  can  be  written  in  compact  form  as 

x_  ■  A  ij 

where  A.  *  (£j  a^)  is  an  M>«  matrix  and  _u  *  (u,  u2...  i^)  •  is  a 

column  vector  of  integers.  The  lattice  is  called  the  lattice 
generated  by  A  and  is  denoted  LA.  An  example  of  a  lattice  is  the  set 
of  all  vectors  with  integral  coordinates.  It  is  generated  by  the 
vectors  *»«.«eM  where  e^  is  an  M-vector  with  1  at  position  i  and 

0  elsewhere,  i.e.  ei  •  (0.. .0)0. ..0) '.  The  lattice  is  denoted  Lj 
where  I_  is  then  the  identity  matrix. 

Delation  Between  Different  Bases  of  a  Lattice 

Since  we  consider  merely  as  a  set  of  points,  it  can  be 
expressed  in  terns  of  more  than  one  basis.  For  example  (a-c,  b-d)', 
(-c,~d) '  and  (a,b)',  (c,d)'  are  both  bases  for  the  lattice  of  Figure 


'•  t 
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Consider  •  lattice  generated  by  ,  a^,...,^  and  suppose  that 
the  vectors  ba#.*.#bH  are  also  a  basis  for  the  lattice.  Then 
the  (a^j  can  be  written  as  a  linear  combination  of  the  (b^): 


±i 


hi£l  *  hiA  * 


+  Ui£* 


(10) 


where  {h^}  are  all  integers.  However,  since  [a^ }  is  a  basis,  we 
must  have 


*i 


9i1±l 


*  9i2— 2 


9itA« 


(11) 


where  the  (g^}  are  all  integers.  Sustituting  for  (tr)  in  (10)  from 
(11)  we  find  (aAJ  as  a  linear  function  of  {a^}.  Since  the  {jr}  are 
independent,  the  matrix  of  the  resulting  transformation  must  be  the 
identity  matrix.  Therefore  the  product  of  the  matrix  (h^J  by  the 
matrix  Ig^l  must  be  the  identity  matrix,  so  that  Ihjjl  is  the  in¬ 
verse  of  Igjjl  and  detth^j]  is  the  reciprocal  of  detlg^^).  However, 
these  determinants  must  be  integers  since  their  elements  are  inte¬ 
gers.  It  follows  that  detlg^J  -  detlh^]  -  *1.  Thus  [h^]  and 
(9|jJ  *re  unimodular  matrices.  We  have  just  proved  that 
if  (a j }  and  {b^}  form  a  basis  for  the  same  lattice,  they  must  be 
related  by  a  unimodular  linear  transformation.  The  converse  is  also 
true.  If  1^}  is  a  basis  and  }  is  obtained  from  {a^}  by  a  uni¬ 
modular  transformation,  then  {b.}  is  a  basis.  This  follows  easily 
from  the  fact  that  the  inverse  of  a  unimodular  matrix  is  unimodular, 
so  that  (_a j J  can  be  expressed  es  a  linear  combination  of 
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the  {b^J  with  Integral  coefficient!  and  therefore  {b^j  fora  a  baaia. 
This  completes  the  proof  of  the  following  theorem: 

theorem  it  Let  {a^  ,0^,. . .  ja^}  be  a  baaia  for  a  lattice.  A  neceaaary 
and  aufficient  condition  that  another  set  of  independent- 
vectors  {bj,...,^}  fora  a  basis  of  the  lattice  is  that  {b^|  My  be 
obtained  from  (a,.}  by  a  unimodular  transformation. 


This  shows  that  there  is  a  one- to- many  relationship  between  lattices 
and  matrices.  To  each  nonsingular  matrix  A_  there  corresponds  a 
single  lattice  L  generated  by  its  columns,  but  for  each  lattice  L 
there  corresponds  a  whole  class  of  nonsingular  matrices.  Two 
matrices  A_  and  B_  belong  to  the  same  class  if  and  only  if  ^  ■  B  U 
where  £  is  a  unimodular  matrix. 

Sublattlcea 

Let  LA  be  a  lattice  and  Lg  be  another  lattice  contained  in  L^j 
i.e.  every  vector  in  Lg  is  also  a  vector  in  Lft.  Lg  is  then  called  a 
sublattice  of  L^.  Then  any  basis  of  Lg  can  be  written  as  a  linear 
combination  of  vectors  from  the  basis  of  L^t 


where  {g^}  are  all  integers.  Denote  the  absolute  value  of  det  tg^jl 
as  a.  Then  a>0  since  the  {b^}  are  linearly  Independent  and  a  must  be 
an  Integer  since  {g^}  are  integers.  If  m-1,  by  the  previous 
theorem  (b^ }  is  also  a  basis  of  LA  and  so  LA  and  Lg  are  the  same 
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lattices.  The  number  m  is  uniquely  determined  by  Ig  end  Lg  because 
choosing  any  other  basis  for  LA  or  Lfi  is  equivalent  to  Multiplying 
the  Matrix  either  on  the  right  or  on  the  left  by  a  unioodular 
Matrix  and  this  Multiplication  does  not  change  the  value  of  the 
deterMinant  of  the  matrix.  »  is  called  the  index  of  Lg  in  LA.  He 
have  just  proved  the  following  theorem: 

theorem  2t  Lg  is  a  sublattice  of  LA  if  and  only  it  ^  *  AC  for  some 
inceger  matrix  C. 

As  an  illustration,  let  -  Lj  be  the  lattice  of  integer 
vectors  with  basis  (1,0)*  and  (0,1)*  while  LB  is  the  lattice  with 
basis  vecors  (2,0)  *  and  (0,3)*.  It  is  easy  to  shown  that  lg  is  con¬ 
tained  in  Lg  and  the  index  of  t<B  in  Lj  is  6. 

In  fact  if  JM  is  any  integer  matrix  then  1^  is  a  sublattice  of 
tj  (the  lattice  of  integer  vectors)  since  the  following  equality  is 
always  true 


N  ■  I  N 


greatest  common  Sublattice 

An  integer  matrix  A_  is  said  to  be  a  left  divisor  of  an  integer 
matrix  if  there  is  another  integer  matrix  £  such  that 

B  ■  A  C 
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At  the  iui  time  B_  is  said  to  bs  a  right  multiple  of  A.  Similarly  C 
is  a  right  divisor  of  B_  and  £  it  a  left  multiple  of  C . 

For  two  integer  matrices  A  and  B_  there  always  exists  an  inte¬ 
ger  Matrix  £  called  the  least  common  right  multiple  (lcrm)  of  A  and 
£.  _D  is  defined  by  three  relationships. 


I)  D  u  a  right  multiple  of  A,  i.e.  D_  ■  A  0  for  some  inte¬ 
ger  matrix  0. 


2)  D  is  i  right  multiple  of  B,  i.e.  £  ■  B  V  for  some  inte¬ 
ger  matrix  V. 


3)  Whenever  £  is  some  right  multiple  of  A  and  B  then  C  is 
also  a  right  multiple  of  D,  i.e.  G_  ■  £w_  for  some  inte¬ 
ger  matrix  w. 


This  last  property  explains  the  notion  of  a  least  common  right 
multiple. 

The  construction  of  the  lcrm  is  done  as  follows:  first  con¬ 
sider  the  2M*2M  matrix 


£  0  J 


then  find  a  unimodular  matrix  X  of  order  2M( 


p-^'7 
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in 

—12 

.—21 

—22. 

such  that 


A  B 

0  0 


X..  X.  " 

’  H  0 

—11  —12 

■ 

.—21  *22. 

£  0 

It  can  be  shown  (24]  that  the  matrix  x_  can  always  be  constructed 
using  elementary  row  and  column  operations.  Then,  we  have  the  rela¬ 
tions 


A  X 
0  - 


12  *l& 


—  — 12 


22 


B  X 


0 

22 


where  D.  is  the  lerm  of  A_  and  B_. 
As  an  example  let 


A 


B 


Then  we  find 


0  0  0  1 
x  .  1  -2  4  6 

-  0  0  10 
0  1-4-4 


'  Proa  X,  we  compute 


t-[*  *1 

Le  12  -* 


He  return  now  to  lattices  to  introduce  the  concept  of  a 
greatest  common  sublattice.  Let  LA  and  Lp  be  two  lattices.  Then  the 
set  of  vectors  common  to  the  lattices  is  a  lattice  LQ  called  the 
greatest  common  sublattice  of  LA  and  Lp.  It  may  also  be  defined  as 
that  sublattice  of  LA  and  Lp  which  contains  every  conmon  sublattire 
of  LA  and  Lp.  It  is  interesting  to  ask  for  the  relation  between  the 
matrix  D  and  the  matrices  A  and  B. 


Theorem  3i  It  0,  is  the  lccm  of  A_  and  B.  then  Lp  is  the  greatest 
common  sublattice  of  LA  and  Lp. 


>roof i  LD  is  a  sublattice  of  LA,  thus  by  theorem  2,  D_  -  for  some 
integer  matrix  0.  Similarily  D_  ■  B_  \r  since  Lp  is  also  a  sublattice 
of  Lp.  Consequently  0_  is  a  common  right  multiple  of  A_  and  B.  Again, 
because  of  theorem  2,  Lc  is  a  sublattice  of  LA  and  Lp.  Therefore  Lg 
is  a  sublattice  of  Lp  sinca  Lp  contains  every  common  sublattice  of  LA 


2-W 
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w  '  r  - 


- '  -  '-wv-  ;.:v  *' 


f;  ’  •’  v  v  ^  ^  ?>r  ?  *jstf  ;;  & 


and  Lg.  Thus  £  •  j>  W  for  some  integer  matrix  W»  thia  finishes  the 
proof)  since  then  D_  satisfies  all  the  three  necessary  conditions  for 


a  lerm. 


atuences  Relative  to  «  Sublattice 


Definition i  Let  ja  and  J2.  b».-  two  vectors  belonging  to  a  lattice  LA. 
Let  Lg  be  a  sublattice  of  L^.  _n^  is  said  to  be  congruent  to  jn  modulo 


B,  written 


n  5  m  (modulo  B) 


if  (rv-m)  is  a  vector  belonging  to  Lg. 


This  relation  defines  a  set  of  equivalence  classes  called  the 
set  of  residues  modulo  B.  A  class  (nj  is 


JnJ  ■  ;m_  c  Lft  such  tha  m,  =  ji  (modulo  B) 


This  set  of  classes  is  denoted  w  Thus,  two  vectors  belong  to  the 
same  class  if  they  are  congruent. 

As  an  illustration  consider  Figure  4  where  LA  ■  Lj,  the 
lattice  of  integer  vectors.  s  l*  the  sublsttice  generated  by  the 
vectors  (2  0)'  and  (0  3)'.  Then  any  two  integer  vectors  that 
occupy  the  same  positions  inside  the  rectangles  are  congruent.  If  we 
choose  one  vector  from  each  class  we  get  a  set  of  representatives. 


a -so 
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Foe  tuiplii  all  Cha  heavy  dots  inside  the  first  rectangle  in  the 
first  quadrant  consitute  a  complete  aet  of  representatives.  Pigures 
S  and  6  illustrate  another  example  of  reeidue  classes.  In  the  next 
subsection  we  will  see  how  to  choose  a  useful  set  of  representatives. 


theorem  4i  Given  any  M  independent  vectors  belonging  to  a 
lattice  L  (the  (b^J  do  not  necessarily  constitute  a  basis  for  L), 
there  exist  vectors  C|i>..<e^  such  that 


£1  “  £n£i 

£2  *  £2l£l  *  —22-^2 


“  £Wi£i  *  •••  *  IVoAi 


where  the  jh^}  are  real  positive  numbers  which  satisfy 


0  «  hij  <  hii  '  1  <  j  <  i  <  M 


and  such  that  c_j,  c?, . . .  (c^  torn  a  basis  for  L.  Thus  £  ■  £  H  where  H_ 
is  an  upper  triangular  real  matrix. 


Equation  (II)  can  be  inverted  to  giver 


<^-57 


’  2.11  -i 

k2  -  a a,  £,  ♦  £22  £2 


£m  "  2«1  £1 


£m 


The  sketch  of  the  proof  can  be  inferred  from  the  following 
example.  Let  L  be  generated  by  (1,0)',  (1/2,  t/2)*.  Iet^.aj  be  the 
vectors  (5/2, 5/2)',  (-3,9)*.  How  can  we  find  Cj,  Cj? 

Let  t,  be  a  positive  real  number.  Consider  the  values  of  t, 
for  which  t^a,  belongs  to  L.  The  smallest  such  value  for  t,  is  1/5 
and  it  is  clear  that  any  such  t,  may  be  written  as  q,/5  where  g,  is 
any  integer.  Now  consider  the  values  of  t2  and  t3,  both  real  posi¬ 
tive  numbers,  for  which  t2  £,  ♦  t^aj  belongs  to  L.  The  smallest 
possible  value  for  tj  will  be  1/12  and  t2  might  equal  1/10  so  that  we 
get  the  vector  (0,1).  The  vectors  £,-(1/2, 1/2)  •  and  ^-(O.l)* 
obviously  form  a  basis  for  L.  Moreover  we  haves 


£l  ’  5  ±1 


£2  ’  To  £1  *  T7  £2 


and  thus 


£l  "  5  £) 


£2  “  “2  £1  *  12  £2 


P--5  3 
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The  method  of  proof  In  the  general  case  Is  the  same  and  will 
not  be  presented  here.  For  a  complete  proof  see  (25). 


Wain  Tbeoresi 

He  are  now  ready  to  present  the  nost  important  theorem  for  our 
purposes. 


Theorem  St  Let  Lg  be  a  sublattice  of  L^.  Then 

1)  the  number  of  different  residue  classes  modulo  B  is  a.  the 
index  of  l-A  in  Lg. 

2)  The  set  of  vectors  Uj  £j  ♦  u2  _Cj  +  •••  *  um^i  where 
c^,C2>...<  Cg  are  constructed  as  in  the  previous  theorem 
ard  where  u,,  Uj,...,!^  are  integers  satisfying: 

0  <  u,  <  g„ 


0  <  «L.  < 


constitute  a  representative  system  of  residue  classes 


As  an  example  consider  the  lattice  of  Figure  5  generated  by  N 
m  (Nj,  Nj)  where  ■  (4,  1)’  and  ■  (1,  2)’.  A  representative 
system  of  residue  classes  is  given,  as  predicted  by  the  theorem,  by 
ttf  ♦  UjC 2  where  Cj  -  (1,  2)',  c2  *  (',  ’)'#  u,  ■  0,  u2  ■ 
0,1,.. .,6,  (Figure  6). 
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Proof i  Suppose  {b^}  a  basis  for  Lg.  Then  as  in  theorem  4  we  con¬ 
struct  from  it  a  basis  for  LA.  Let  these  basis  vectors  for  LA 
be  Ic^l#  where  by  (12) 


-i 


hil*l 


4  hi2*2  4 


*  hi£i 


(13) 


Inverting  these  equations,  we  find  that  is  expressed  in  terms 
of  {£^)  by  a  triangular  matrix  as  follows: 


^i  “  «ii£i  +  9i2£2  ♦  +  9a£i  <u> 

where  q,,  *  — —  and  all  the  gi4  are  integers  since  c  is  a  basis  for 
Cii  1  1 

La.  Note  that  m,  the  index  of  LA  in  Lg,  is  the  determinant  of  the 

faijl  ■"«>»  therefore  since  the  matrix  is  triangular,  ">*9u922***9mm' 

He  now  set  up  a  representative  rystem  of  residue  classes 

modulo  B.  Consider  the  vectors 


ui£i 


*  ua£2  + 


♦  U  C 

PI  I  I 


(15) 


where 


0  <  ui  <  .  1  -  1.2 . M. 

There  are  exactly  9u922*’*9mm  "  *  such  vectors.  We  must  prove  any 
vector  in  LA  is  congruent  modulo  B  to  a  vector  of  the  form  (15)  and 
that  no  two  vectors  of  this  form  are  congruent  to  each  other.  Let  _x 


3- S’ 5 
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be  an  arbitrary  vactor  in  ao  that  ■  9^  *  g?c7  ♦  ...  ♦ 
where  tha  gt  ara  Integers.  Divide  gH  by  gm  and  get 

"  %*M  4  “M 

whera  0  <  u,,  <  gm.  Now  using  (14),  we  find  that 

2.  ■  qiA»  “  ’Ui  4  ’2*2  4  •••  4  Ci£m-i  4  Vh 


where  the  g^  are  integers.  Proceeding  in  the  sane  way,  we  can  re- 
duce  gH_j  and  so  on.  until  finally  we  have 


£  '  *,!>,  -  ~  S&  ’  u1-1  4  u2— 24  **  4  V* 


or  x  2  u,c,  ♦  u2£2  ♦  ...  ♦  uMc[t  nodulo  B. 

Suppose  two  of  the  vector  defined  by  (15)  lie  in  the  sane 
residue  class,  that  means  their  difference,  z,  lies  in  Lg.  Let  the 
coordinates  of  be  jk  where 


"*kk  <  3fc  <  *kk 


Suppose  jk  is  the  last  coordinate  not  sero.  Since  s.  is  in  Lg, 
it  is  an  integral  combination  of  bj,  and  therefore  the  kth 
^-coordinate  is  equal  to  or  greater  than  gkk.  This  contradicts  (16) 
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interchange*  roe*  1  and  3; 

"-1  0  •  •  o" 

0  1  •  •  • 

.  0  •  •  •  1  . 

Multiplies  roe  t  by  -1;  and 

t  u  0  •  0 
0  1  •  •  * 

0  •  •  •  1 

adds  u  times  row  2  to  row  1,  where  u  is  an  integer. 

The  above  matrices,  which  effect  elementary  operations,  are 
called  elementary  Matrices.  Elementary  column  operations  are  defined 
In  entirely  analogous  fashion  and  they  correspond  to  Multiplication 
of  £  on  the  r  ight. 

Ne  present  now  the  main  theorem  in  this  section. 

Theorem  5»  (Smith  Normal  Form) 

Every  M*M  integer  matrix  £  can  be  written  as 

N  •  0  D  V 


where  £  and  V_  are  unimodular  matrices  and  £  is  a  diagonal  matrix. 


a-6% 
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d,  0  *0 

0  ij  •  0 


such  that  d|,.'.'dH  are  nonzero  integers  and  d^  divides  di+,, 
1  <  i  <  m-1. 

Proof;  We  will  only  give  an  outline  of  the  proofs  a  bo  re  complete 
proof  can  be  found  in  {28].  We  will  illustrate  the  proof  with  an 
example  which  we  develop  as  we  go  along.  Let 


24 

-70 

-24 

-8 

40 

12 

-12 

36 

12 

We  may  assume  that  N_  «  (n.^j  contains  a  smallest  nonzero  element, 
which  may  be  brought  to  the  (1,1)  position,  namely,  to  the  first  line 
and  the  first  column,  by  suitable  row  and  column  interchanges.  Let  h 
represent  this  term.  In  our  example  h  •  -8,  and  by  interchanging  row 
one  and  row  two,  we  obtain: 


-8 

40 

12 

24 

-70 

-24 

-12 

36 

12 

We  replace  each  term  of  the  first  row  and  the  first  column  by  the 
remaining  (v^j  defined  as  follows 
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Ujj  and  u^|  being  integer  numbers.  For  instance 


40  «  -5x(-8)  ♦  0 
-12  -  1*  (-8)  ♦  (-4) 

This  amounts  to  subtracting  Ujj  tines  the  first  column  from 
each  of  the  columns  in  which  the  first  element  is  not  null.  This  is 
obtained  by  post  multiplying  by  an  appropriate  elementary  matrix.  It 
is  also  equivalent  to  substracting  u^,  times  the  first  row  from  each 
row  in  which  the  first  element  is  not  null.  He  obtain  this  by  pre¬ 
multiplying  with  an  elementary  matrix.  This  process  is  repeated 
until  all  the  elements  in  the  first  row  and  column  other  than  the 
(1f1)  element,  are  made  sero.  Denote  this  new  matrix  L_  -  {l^}.  In 
our  example,  we  obtain 


L  ■ 


0  4  ‘ 

50  0 

-24  0  . 


He  repeat  the  process  by  bringing  tt.e  element  4  in  the  (1,1) 
position.  Then 


(-4  0  °  -I 

L  -  I  0  0  SO 

LO  -4  -24  J 


Assume  that  the  submatrix  of  obtained  by  deleting  the  first 
row  and  column  contains  an  element  1^  which  is  not  divisible  by 
l)i«  Add  column  j  to  column  1.  Column  1  then  consists  of  the  ele¬ 
ments  l)|f  ^2j** ** '^ij* •••  Repeating  the  previous  process  we 
can  replace  l,f  by  a  proper  divisor  of  itself.  In  the  example  above, 
4  doesn't  divide  SO,  so  we  add  column  1  to  column  4.  He  have 


r  4  o  o  1 

L  -  50  0  SO 

~  L-2*  -4  -24  J 


which,  when  reduced,  leads  to 


pool 

t  -  0  0  12 

10  -4  576  J 


Thus  we  must  finally  reach  the  stage  where  the  element  in  the  (1,1) 
position  divides  every  element  of  the  matrix,  and  all  the  other  ele¬ 
ments  of  the  first  row  and  column  are  sero. 

The  entire  process  is  now  repeated  with  the  submatrix  obtained 
by  deleting  the  first  row  and  column.  Eventually,  a  stage  is  reached 
when  the  matrix  has  the  required  form.  For  our  case,  we  obtain 


Ji-bl 
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r*  °  °i 

0  ■  0  4  0 

'  LO  0  1 2 J 


Several  facts  about  this  fora  will  be  needed. 


a)  t>  is  unique  and  det(D)  ■  Idet  N_l  -  d1d2«...d|s. 


b)  This  fora  defines  an  equivalence  relation  in  the  space  of 
integer  matrices.  Two  matrices  are  equivalent  if  they 
have  the  same  normal  form.  7hu3  an  equivalence  class  will 
consist  of  all  the  matrices  whose  normal  forms  are  alike. 


c)  It  will  be  useful  for  us  to  know  the  number  of  equivalence 
classes  for  matrices  with  a  given  determinant.  This 
number,  which  will  be  used  for  classification  of  DFTs, 
will  be  calculated  in  a  latter  section,  after  we  give  some 
properties  of  the  numbers  {d.J.  These  quantities  are 
known  as  the  invariant  factors  of  N.  Thus  2  matrices  ara 
equivalent  if  and  only  if  they  have  the  same  invariant 
factors. 


Elementary  Divisions 

Let  tj,t2,...,t2  be  a  complete  set  of  primes  which  occur  as 
divisors  of  the  invariant  factors  {d^}.  Then  for  appropriate  nonne¬ 
gative  integers  e^j  we  have 


=3 -4a 
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so  on.  Thus  2  matrices  are  equivalent  if  and  only  if  they  have  the 
same  elementary  divisors. 

For  example,  suppose  the  4*4  matrix  _N  has  elementary 
divisors  {2,22,22,23,3,33,33,5,5,7}.  Then  d4  -  23*33*5‘7.  Deleting 
23,33,5,7,  we  have  the  set  {2,22,22,3,33,5}.  Then  d3  -  22*33*5. 
Deleting  22,33,5,  we  have  the  set  {2,22,3).  Then  dj  ■  22»3  and 
finally  dj  ■  2.  Thus  the  normal  form  of  N_  is 


22*3 


22,33*5 


23‘32*5*5 


The  next  theorem  is  useful  if  an  equivalent  diagonal  matrix  is 
known  or  if  the  given  matrix  itself  is  diagonal. 

Theorem  S»  Suppose  that  the  integer  matrix  N_  is  equivalent  to  a 
diagonal  matrix 


Then  the  prime  power  factors  of  the  b^,  1  <  i  <  a,  are  the  elementary 
divisors  of  N. 
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Proof t  Let  t  b«  any  prim*  which  divides  sone  b^,  1  <  1  <  a.  Arrange 
the  according  to  ascending  powers  of  t: 


bi1  "  ‘  *1 


b12-t  s2 


b,  *  t  sm 
in 


where  the  s^  are  relatively  prime  to  t,  and  0  <  e,  <  e2  <...<  e^. 

Then  clearly  the  exact  power  of  t  that  divides  dk,  t  <  k  <  m, 

%  «k 

is  t  .  Thus  t  is  an  elementary  divisor,  1  <  k  <  m.  Applying  this 
argument  for  all  primes  t  which  divide  some  b^,  we  obtain  the  result. 


As  an  example,  assume  N  is  equivalent  to 


then,  since  20  «  22*5,  6  -  2*3,  18  •  2*32,  the  elementary  divisors  of 
8^  ate  2,2,22,3,32,$.  Thus  the  Smith  Normal  form  of  N.  is 


e£?-65" 
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2  20  22  33  sl-P 


He  present  2  more  examples  that  will  be  considered  again  in  the  next 
chapter.  First,  the  matrices 


5] 


are  not  equivalent  since  they  have  different  prime  power  factors, 
namely  {3,3}  for  the  first  matrix  and  {3*}  for  the  second  one.  The 
other  example  is  that  of  the  matrices 

i:  :i  •  [i  a 


wher  p  and  q  are  relatively  prime  numbers.  Their  prime  powers  are 
the  union  of  the  prime  powers  of  p  and  the  prime  powers  of  q.  So 
those  2  matrices  have  the  same  prime  powers  and  hence  are  equiva¬ 
lent.  That  idea  was  exploited  by  Good's  mapping  theorem  (8]  in  the 
development  of  the  one-dimensional  Prime  Factor  Algorithm. 
HMltiplicatlvity  of  the  Smith  normal  Form 

Let  N  be  i  nonsingular  integer  matrix  such  that  «  P_(J,  where 
J?  and  <£  are  nonsingular  Integer  matrices.  Let  dk(N),  dk  ( P) ,  dk(Q) 
denote  the  kth  invariant  factors  of  N,  P_  and  Q,  1  <  k  <  m.  Then  the 
following  is  true. 
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Buotai  81  dk<N)  is  divisible  by  dfc(P)  and  by  dk(Q)  toe  1  <  k  <  a. 

The  proof  is  quite  involved  and  can  be  found  in  [28],  From 
this  theorea  we  easily  deduce, 

Iteores  9;  Suppose  the  detern inants  of  P_  and  <Q  are  relatively 
prime.  Then  the  normal  form  of  is  equal  to  the  product  of 

the  normal  form  of  _P  and  the  normal  form  of  Q. 

Proof s  Since  det<P)  and  det(Q)  are  relatively  prime  and  dk(jp) 
divides  det(P),  dk(Q)  divides  det(£),  it  follows  that  dk(P)  and  dk(Q) 
are  relatively  prime  for  1  <  k  <  n.  Then  the  previous  theorem 
implies  that  dk(N)  is  a  multiple  of  dk(P)  *dk(Q)  for  1  <  k  <  m.  But 
d,  (N)  *d2(Nl  ...dm(N)  -  det(N)  .  Thus  dk(N)  -  dk(P)  dk(Q)  .  This  com¬ 
pletes  the  proof. 

The  theorem  is  definitely  false  if  det(P)  and  det(Q)  are  not 
relatively  prime.  For  example  if 

»  rl  „  r  I  0, 

-  “  lO  s  1  '  SL  *  I  8  ‘ 

then  the  Smith  Normal  form  of  P  and  Q  is 


fl  0, 

l0  8 1 


cP-67 


\ 


\ 
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but 


2.  -  Z  2.  •  l. 


o  s  1 

■8  64 J 


whose  Smith  Normal  form  is 


r 

l0  8J 


while  the  product  of  the  normal  forms  of  P  and  Q  is 


t 


1 

0 


0 
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] 


The  Smith  Normal  norm  Class  Humber 

He  are  now  going  to  count  the  number  s(t)  of  equivalence 
classes  of  mmm  matrices  of  fixed  nonzero  determinant  t.  Let 


*1  *2 
t  -  t%\2. 


be  the  prime  power  decomposition  of  t.  Then  s(t)  is  just  the  number 
of  ways  of  forming 


so 


*n  *ij  *u 

a,  (K)  -  t,  t2  ...tl 


*21  *22  *21 
<Ja<2t>  -  t,  t2 


e  ,  e  ,  e  , 
ml  m2  ml 

V-1  “  fc1  t2  ‘••tl 


where  the  nonnegative  integers  e^  Satisfy 


°  ‘  *»j  *  *2j  ‘  •**  *  *mj  * 
*1j  *  *2  j  *  •“  *  *«J  "  *j 


(IS) 


For  each  j  such  that  1  <  j  <1,  equation  (18)  has  p  (e^.m)  solutions 
where  p(ej,m)  is  the  number  of  partitions  of  m  into  parts  not  exceed¬ 
ing  Cj,  or  equivalently,  into  at  most  e^  parts.  Thus 


Mt)  -  P(«1  ,ra)  •  p(e2,»)  ...  p(e1#m) 

As  an  illustration,  let  t  -  1«  and  m  ■  2.  Then  t  ■  2*  and  so  e,  « 
4.  Thus 


s(t)  -  p(4, 2)  -  J 


CHAPTER  HI 


MDUIDDOEHSIOHAL  ITT  ALOORITOtS 

The  Mathematical  concepts  and  results  discussed  in  the 
previous  chapter  will  be  used  here  to  derive  some  new  results.  As 
already  pointed  out  in  the  introduction,  one  significant  contribution 
of  our  work  is  the  formulation  of  a  mathematical  context  that  permits 
a  better  understanding  of  the  qeneral  multidimensional  DFT  (MDFT) . 
More  specifically,  it  will  be  seen  that  the  indices  of  the  multi¬ 
dimensional  DPT  can  be  regarded  as  members  of  a  lattice  structure. 
Also,  residue  classes  for  lattices  are  seen  as  a  mean  for  operating 
on  these  indices.  Prom  the  insight  gained,  a  host  of  new  results  can 
be  developed.  Our  aim,  in  this  chapter,  is  to  demonstrate  this  by 
deriving  a  set  of  new  algorithms  for  the  evaluation  of  MDFTs.  First, 
we  present  in  section  one  an  indirect  method  which  transforms  a 
general  MDPT  into  a  rectangular  DFT  that  can  be  evaluated  in  a 
conventional  manner.  Apart  from  its  obvious  practical  usefulness, 
this  method  has  a  significant  theoretical  importance.  Indeed,  it  is 
used  to  demonstrate  that  the  MDFTs  can  be  compared  in  terms  of  the 
number  of  multiplications.  In  addition,  using  the  discussion  in 
section  two  of  the  previous  chapter  on  Smith  Normal  forms,  a  classi¬ 
fication  of  MDFTs  according  to  both  length  and  form  is  provided. 
This  classif ication,  in  its  turn,  will  have  a  great  impact  on  the 
design  of  practical  algorithms. 


It  is  expected  that  a  more  direct  approach  to  the  problem  can 
make  better  use  of  the  mathematical  ideas  discussed  in  Chapter  II. 
Our  aim  is  to  generalise  directly  the  methods  used  in  evaluating 
one-dimensional  DFTs.  Most  of  today's  efficient  one-dimensional  FFT 
algorithms  are  based  on  a  theorem,  in  number  theory,  called  the 
Chinese  Remainder  theorem  (CRT)  for  integers.  An  equivalent  theorem 
for  integer  vectors  is  needed  to  be  able  to  develop  similar 
algorithms  for  MDFTs.  After  an  intensive  search  in  the  mathematical 
literature,  no  such  theorem  was  uncovered.  The  most  general  form  of 
the  Chinese  Remainder  theorem  was  found  to  exist  for  commutative 
modules  [29].  Unfortunately,  in  our  case,  lattices  are  noncommu- 
tative  modules.  Thus  we  have  been  led  to  prove,  ourselves,  a  Chinese 
Remainder  for  lattices.  A  formulation  and  a  complete  proof  is  given 
in  section  two. 

In  section  three  we  get  to  the  core  of  the  problem  which  is 
the  design  of  FFT  algorithms.  Using  the  theorem  from  section  three, 
we  derive  a  general  class  of  FFT  algorithms  for  the  MDFT.  The  al¬ 
gorithms  differ  in  the  manner  with  which  the  DFT  modules  are 
nested.  In  particular,  a  matrix  Prime  Factor  algorithm  (MPFA)  and  a 
matrix  Winograd  Fourier  Transform  algorithm  (KWFTA)  are  described  in 
section  four.  As  in  the  one-dimensional  case,  the  factorization  of 
the  periodicity  matrix  constitutes  an  important  step  in  the  design 
process. 

One  of  the  main  disadvantages  of  the  KDFT  is  the  necessity  to 
keep  track  of  all  the  indices  explicitly.  In  the  rectangular  DFT, 
both  the  frequency  and  time  indices  are  implicitly  known  if  the 
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length  of  the  OFT,  in  each  dimension,  la  known.  In  section  five,  e 
method  is  described  which  peraits  the  description  of  the  indices  of  a 
general  ME  FT  in  rectangular  fora.  This  method  will  help  in  the  de¬ 
sign  of  efficient  DPT  modules. 

A  complete  example  is  given  in  section  four  which  clearly 
illustrates  the  inner  details  of  the  MPFA.  The  example  also  serves 
as  an  introduction  to  the  next  chapter,  which  addresses  the  practical 
side  of  the  dissertation. 

The  BPV  Algorithm  [301,  131) 

Many  FFT  algorithms  have  been  developed  for  the  evaluation  of 
rectangular  MDFTs.  In  this  section,  we  derive  a  procedure  which 
permits  the  use  of  these  algorithms  for  general  MDFTs.  As  we 
mentioned  before,  the  procedure  will  have  both  practical  and 
theoretical  usefulness.  On  the  practical  side  it  provides  the 
ability  to  use  existing  software  and  hardware  facilities  to  Implement 
the  program.  Its  main  theoretical  contribution  is  in  showing  how 
MDFTs  can  be  compared  in  terms  of  multiplicative  complexity.  The 
approach  results  in  a  classification  of  MDFTs  which  has  further 
practical  usefulness  in  the  design  of  algorithms. 

He  start  by  considering  a  matrix-N_  DPT  as  defined  in  Chapter 
It 

X(k)  -  l  x(n)  exp[-J2vkTH*1rt) 

-CIH  (19) 

T 

-  '  rN  ' 
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whs re  x(_n)  is  the  input  sequence,  with  finite  support  on  Ij,.  He  will 
make  cleer ,  shortly,  that  this  definition  unduly  restricts  the  values 
that  £  and  k^  can  take.  It  will,  perhaps,  be  of  interest  to  let  the 
domain  of  n_  and  k^  be  as  large  as  possible  with  the  only  restriction 
being  that  the  DPT  values  resain  unchanged'.  Recall  froa  the  deriva¬ 
tion  of  (19)  that  x(n)  is  periodically  extended,  at  least 
conceptually,  to  fors  a  multidimensional  periodic  sequence,  with 
period  N.  N_  defines  a  lattice  L^,  as  explained  in  Chapter  II,  and 
the  set  of  indices  ne  1^  is  then  a  set  of  representatives  for  the 
residue  classes  Therefore,  we  can  replace  the  set  of  jvci^ 

with  any  set  of  representatives,  with  no  effect  on  the  final  values 
of  the  MDFT  defined  in  (19).  Similarly,  the  frequency  indices  k  form 
a  set  of  representatives  for 

In  this  light,  a  better  definition  for  a  matrix-N  DfT  would  be 


l 

-CtI/1 J 


x(n)  exp[-j2tkTN-1n) 


-  C  LI/H 


In  this  formula  there  are  no  restrictions  on  the  indices  _n  and 
_k_  except  that  they  must  form  sets  of  inequivalent  elements  with 
respect  to  lattices  and  I^T  respectively. 

Next,  if  N_  is  nondiagonal,  we  have  seen  in  Chapter  II  that  we 
can  write  it  in  Smith  Normal  form  as 


N  •  0  D  V 


-2-7V 
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where  £  le  an  Integer  diagonal  matrix. 


where  dj  divides  d^j,  i  •  1,...,m-1,  and  U_  and  V  are  uninodular 


matrices. 


Substituting  (21)  into  (20)  the  KDPT  summation  becomes 


X(k)  *  l  x(n)  expt- j2xkTv"1D“1o'1n) 

*Ll/»V 


*eLi/n 


Let  us  define  new  integer  variaolest 


*  -1 
n  *  0  n 


*  -IT 
k  -  (V  )  k 


We  claim  that  the  sets  {£}  and  {£)  form  legitimate  sets  of 
representatives  for  the  residue  classes  Lj^j.  Moreover,  since  tJ-1 

a  ■  A  A 

and  (V  )T  are  unimodular,  the  sequence  x(n)  (x  (Jk> )  is  simply  a 
reindexing  of  the  samples  of  x(_n)  (x  (JO ).  The  claim  is  proved  by  the 
intermediary  of  the  following  theorem. 


a-if 
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Theorem  9t  Let  *  and  B_  be  nonsingular  integer  matrices,  and  let  the 
product  A,  lj/B  denote  the  aet  of  elements  obtained  by  Multiplying 
elements  of  LI/1  by  A.  Then  the  following  equality  of  aeta  is  true: 


— ^1/B  “  S/AB 


Proof:  He  start  by  selecting  a  special  set  of  representatives  for 

the  residue  classes.  To  do  this,  let  us  define  R  as  the  set  of 
rational  vectors  i_  all  of  whose  coordinates  r ^  satisfy  the  condition 
0  <  r4  <  1,  i  ■  Then,  it  can  be  proved  (32),  that  the 

set  (integers  J>  -uch  that  Bf^,b  e  r)  fora  a  set  of  representatives  for 
Lj^g.  Geometrically,  this  set  consists  of  all  the  integer  vectors 
contained  inside  the  ptrallelopiped  defined  from  the  columns  of  B. 

The  equality  (25)  is  proved  by  showing  set  inclusion  in  both 
ways.  Let  ji_  be  an  element  of  -  i.e.  there  exists  a  vector 

b_  C  Lj^b  such  that  J>.  Prom  our  above  choice  of 

representatives,  we  Slave  that  Bj^b.  £  R.  But  £  »  a” '_n»  thus 
B-1A-,|«_  c  ?.  This,  in  its  turn,  implies  that  _n  e  B*  Thstefore, 

we  have  just  proved  that  A.  **1/8  ®  B"  **°w'  we  *et  —  e  B' 

then  B-,A-1jj_  e  R,  which  is  equivalent  to  saying  that  A”’jn_  c 

Define  a  new  vector  _b  as  Js  ■  A__1n_,  or  as  _n  ■  A__b.  Since  _b  e  Lj^g, 

then  _n  e  ALj^,fl  by  definition  of  the  set  ALj^g.  Therefore  b  c 

H'l/B'  Combined  with  the  first  part  of  the  proof,  this  proves  that 
the  sets  are  equal. 


a-ntp 
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We  now  use  this  thaorea  to  prove  the  claim  we  have  made  about 

equations  (23)  and  (24) .  Note  that  since  c  L^,, 

*  —1  —1 
then  n  c  0_  Lj^u*  But,  applying  the  above  theorem,  0_  Ljy^  ■ 

In  addition,  recalling  that  N_  ■  0_  D__V,  we  have  that  • 

iT’Vj  D.  ■  D_  V.  Therefore,  n  e  i*j/D  y  We  recall,  from  the  presen¬ 
tation  of  lattice  theory  in  Chapter  II,  that  D_  and  JJ  are  bases  for 
the  same  lattice  since  V_  is  unimodular.  Thus  m  Lz/o  v  and’ 

consequently,  n  e  Lj^j. 

Similarly,  since  D_  is  diagonal,  _D  ■  t?  and  thus  Jc  t  « 

Ll/tr 

In  view  of  the  preceding  discussion,  equation  (22)  can  be 

written 


X(k) 


x(n)  expl-j2w£rD”,nJ 


nel,  J9\ 

—  I/t> 


(26) 


k  c  L.  _ 
-  I/D 


Tills  sum  is  seen  to  be  a  matrix-D.  OPT.  And,  J>  being  a  dia¬ 
gonal  matrix,  we  have  obtained  a  rectangular  DPT.  This  decomposition 
provides  the  following  algorithm  for  evaluating  a  matrix-N_ DPT: 

-  Express  N  in  Smith  Normal  form  as  N  ■  0  D  V. 


-  Scr  table  ilia  input  sequence  according  to  the 

*  _i 

relation  n  ■  £  n. 

-  Compute  a  rectangular  DPT  of  the  resulting  aequence  using 
the  diagonal  periodicity  matrix  d. 

-  Unscramble  the  output  sequence  according  to  the 
relation  Jr  »  V  'l. 

Several  observations  can  be  derived  from  this  algorithm. 
First,  the  reindexings  of  the  second  and  fourth  steps  involve  no 
multiplications.  Therefore,  the  matrix-£  DPT  and  the  matrix-£  DPT 
possess  the  same  multiplicative  complexity.  Second,  the  rectangular 
OFT  will  itself,  in  general,  require  data  shuffling.  The  latter  can 
be  combined  with  the  shuffling  present  in  steps  2  and  4,  and, 
therefore,  it  is  to  be  expected  that  the  number  of  additions  and  data 
shuffling  for  both  DPT's  would  be  comparable. 

As  an  example,  consider  the  evaluation  of  an  MDPT  with 
periodicity  matrix  £  given  by 


«-(22  5) 


(27) 


Assume  that  the  input  samples  are  available  as  a  2  by  4  rectangular 
array  and  that  the  output  samples  are  required  to  have  the  same 


’*  *  *..*  '•  •»*  ,  .  .* v. • ,  *  ,  !  -*•  f  '  .  f  . 

♦  •  -"/,  /  ■  •-.«/'  '  ,,v  .•  -  •  <■,  : 
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format .  Thu*  th«  residue  cl ••■«*  for  both  the  input  end  output  are 
fixed  and  are  given  by 


I/M 


i(0,0>T,(1/0)T,(0,1)T,{1,1)T,(0,2)T, 

(1,2)T,(0,3)T,(1,3)T} 


(28) 


and  are  shown  in  Figure  7.  This  choice  was  made  for  the  purpose  of 


comparison  between  the  MDFT  and  a  2  by  4  rectangular  DFT. 
The  Smith  Normal  factorization  of  N  is  found  to  be 


N 


L 


i 

i 


(29) 


Thus,  the  UDV  algorithm  results  in  a  2  by  4  rectangular  DFT.  In 
Figure  B,  we  show  the  flowchart  of  the  2  by  4  rectangular  DFT  of  the 
input  samples  x(n).  The  flowchart  is  derived  by  combining  the  flow¬ 
charts  of  two-point  and  four-point  one-dimensional  DFT's.  It 
represents  a  row-column  decomposition  of  the  signals.  The  rows  are 
2-polnts  long  and  the  columns  4-points  long.  For  our  purposes  the 
flowchart  is  decomposed  in  3  parts.  The  first  part  consists  of  the 
input  indexing,  the  second  part  contains  the  additions  and  multipli¬ 
cations  steps  and  the  last  step  represents  the  output  unscrambling. 
Figure  9  illustrates  the  natix-N^  DFT  as  evaluated  by  the  U_  D_ 
algorithm.  In  that  figure  we  have  combined  the  indexings  due  to  the 
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X(O.O) 

xn.oi 

X(O.l) 

X(1.1) 

X(02> 

XIU) 

X(0,3) 

xn.3i 


£  and  £  matrices  with  the  input  and  output  indexing*  of  the  matrix-D_ 
DPT,  respectively.  A  comparison  with  Figure  8  clearly  shows  that  the 
DPTs  differ  only  in  the  indexing  parts.  Thus  the  number  of  opera¬ 
tions  is  the  same  for  both  DPTs. 

This  discussion  lead  to  the  important  conclusion  that  the  com¬ 
putational  complexity  of  the  matrix-N.  DPT  is  completely  determined  by 
the  Smith  Normal  form  of  N.  And  thus,  results  in  complexity  theory 
for  rectangular  DPTs  (12)  can  be  readily  applied  to  matrix-DFTs  in 
general. 

Recall  from  Chapter  II  that  there  are  a  finite  number  of  in¬ 
equivalent  matrices  £  with  a  given  determinant  n.  Equation  (18) 
gives  that  number  as  a  function  of  the  determinant  whose  value  is 
also  the  number  of  data  samples.  A  procedure  which  provides  all  of 
those  inequivalent  normal  forms  was  also  presented.  Thus,  given  the 
number  of  data  points  n,  one  has  only  to  examine  a  finite  set  of 
rectangular  DPT's  to  have  a  complete  character ixat ion  of  the  n- point 
matrix-DFTs. 

To  illustrate  this  fact,  consider  again  the  example  provided 
in  section  four  of  Chapter  II.  It  was  found  that  for  a  number  of 
data  samples  of  16,  there  are  3  inequivalent  normal  DPTs  with 
periodicity  matricesi 
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Dj  correspond*  to  •  16-point  one-dimensional  DFT,  D?  to  «  2  by  8 
rectangular  tvo-d lmena ional  DfT  and  Dj  to  a  4  by  4  rectangular  two- 
dimensional  EFT.  Moreover,  any  16-point  two-dimensional  matrix-N_DFT 
will  decompose  into  one  of  these  3  OFT*. 

It  can  be  shown  (33]  that  the  natrlx-Dj  DFT  may  be  evaluated 
•ore  efficiently  than  the  astr  ix-P;  DFT.  At  the  sane  time,  the 
latter  is  more  efficient  than  the  matrix-^  DFT.  Therefore,  there 
exists  a  total  ordering  of  the  DFTs  in  terns  of  computational 
complexity  and  this  ordering  can  be  deduced  by  looking  at  the  class 
of  normal  forms. 

TO  prove  the  last  point,  consider,  for  the  sake  of  simplicity 
and  with  no  loss  of  generality,  two  two-dimensional  normal  forms, 
with  determinants: 


*  -[?'  u 


Assume  also  that  d^  <  d2, .  Then  since,  the  determinants  are  equal 


A  il  m  A  <J 
11  12  2122 
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By  the  property  of  normal  forma,  m»  know  that  divides  d,2  and  dj( 
divide*  djj.  Thua,  It  ia  aaay  to  ahow  that  d,j  divide*  d2,,  i.a. 
thara  exists  an  Integer  k  auch  that 


■  k  d 
-  fc  d 


II 

22 


No  substitute  (33)  into  (31)  to  get 


Moreover,  aince  d22  divides  d21,  we  have 


“22 


•id 


21 


(33) 


(34) 


(35) 


for  acme  integer  1.  Combining  (35)  and  (33)  we  obtain 


d„  -  ki  d„ 


(36)  and  (34)  lead  to 


(36) 


(37) 
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The  nest  step  la  to  show  that  matrix-D?  DFTs  can  ba  evaluated  In  a 
■oca  efficient  aannar  than  aatrix-D^  DFTs .  He  will  do  so,  in  a  some- 
what  hautistic  manner,  by  factoring  O,  and  Dj  ast 


-1 

'*11 

Lo 

u 

i: 

a 

°2  “1 

[*ii 

0 1 

°i 

l-o 

Lo 

k  J 
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The  two  matrices  differ  by  their  second  'actor.  The  factors,  if  used 
in  a  algorithm  such  as  the  matrix  Cooley-Tukey  algorithm  or  in  any 
algorithm  that  will  be  developed  in  the  next  sections,  will  result  in 
DFTs  with  different  computational  complexity.  It  is  known  [33],  that 
a  rectangular  k  by  k  2-0  OFT  is  more  efficient  than  a  k2-point  1-0 
DF7.  Therefore  the  mattix-D^  OFT  can  be  evaluated  more  efficiently 
than  the  matrix-D^  DPT. 

He  summarize  the  above  discussion  by  saying  that  the  MDFTs  can 
be  classified,  in  terms  of  computational  complexity,  both  by  length 
and  by  form.  The  length  refers  to  the  number  of  data  samples  and  the 
form  corresponds  to  the  shape  of  the  Smith  Normal  form.  For  a  given 
length,  there  are  a  known  finite  number  of  forms  and  these  can  be 
totally  ordered  according  to  computational  efficiency.  The  ordering 
is  accomplished  by  examining  the  invariant  factors.  More 
specifically,  it  is  done  by  looking  at  the  first  non-equal  invariant 
factors,  starting  from  the  top  rows  of  the  matrices.  The  form  cor- 


responding  to  the  lergeet  lneer lent  factor  result  In  a  do re  efficient 
algorithm*  aa  explained  in  the  previous  example.  An  intuitive 
explanation  is  that  this  fore  is  somewhat  more  balanced. 


Chinese  Remainder  Theorem  for  Lattices 
In  this  section  we  state  and  prove  a  purely  mathematical  re¬ 
sult  which  will  be  used  in  subsequent  sections.  The  Chinese 

Remainder  theorem  for  integers  (CRT)  played  a  critical  part  in  the 
development  of  one-dimensional  PFT  algorithms.  When  the  modulus  n  of 
a  congruence  is  composite,  this  theorem  helps  reduce  a  congruence 
modulo  n  to  a  system  of  smaller  congruences  (namely,  congruences  with 
respect  to  the  factors  of  n).  We  begin  by  stating,  without  proof, 
the  theorem  in  its  simplest  form. 


Theorem  (Chinese  Remainder  theorem  for  integers)  122] t  Assume  that 
where  Sj  and  Hj  are  positive  integers  and  assume,  in  addition, 
that  (N|,M2)  *  1.  Let  p  and  q  be  any  integers  and  consider  the 
system  of  congruences 


n  S  p  (mod  Wj) 
n  5  q  (mod  dj) 

Then  this  system  always  has  solutions  and  any  two  solutions  differ  by 
a  multiple  of  H, 


To  illustrate  the  use  of  this  theorem  for  DFTs  consider  a  1-D 
DPT  of  length  N.  The  theorem  may  be  interpreted  to  imply  that  there 
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la  a  one-to-one  relationship  between  th«  act  of  sample  indices  n, 
defined  modulo  H,  and  the  pair  of  indices  (p,q)  defined  modulo  Nj  and 
Mjt  respectively.  It  is  this  re indexing  of  the  sequences  into  higher 
dimensions  that  is  at  the  core  of  the  savings  of  various  ITT  al¬ 
gorithms. 

The  theorem  is  capable  of  vast  generalizations.  Properly 
formulated,  it  holds  in  any  ring  with  identity.  The  most  general 
form  we  have  encountered  in  our  literature  search,  occurs  in  commu¬ 
tative  modules  (vector  spaces  over  rings).  Unfortunately,  our 
interest  lies  with  a  noncommutative  module,  namely  the  lattice  of 
integers.  There  are  no  generalization  of  the  CRT  for  noncoosutative 
nodules,  most  probably  because  each  one  would  constitute  a  fairly 
particular  case. 

We  wish  to  formulate  and  prove  a  similar  theorem  from  a 
lattice  point  of  view.  We  begin  by  presenting  a  useful  lease  con¬ 
cerning  factorization  of  matrices.  Assume  II  is  a  composite  matrix 
whose  determinant  is  the  product  of  two  relatively  prime  numbers  p 
and  q. 


lemma i  There  exists  matrices  P^,  P?t  0,  and  Qj  such  that 

”  *  »ifl|  "  2a*2  <40> 
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trooti  Osing  the  results  from  section  four ,  write  N.  In  Smith  Normal 
form, 

N  -  0  D  V  (41) 

D_  Is  diagonal  and  Idet  D|  *  pq.  it  is  fairly  straightforward  to  show 
that  D  can  be  factored  as 


£  “  £,  (42) 

where  a,  and  are  both  diagonal  and  Idet  l>jl  •  p,  Idet  D^l  •  q.  we 
insert  (42)  into  (41)  to  obtain 

£*££|£2£  (43) 

And,  since  Pt  D?  ■  D?  JD, ,  (43)  can  also  be  written  as 


»  -  0  £j  £,  V 

The  lemma  is  then  verified  by  putting 


(44) 


a, 

52 


"££, 

•  £2a 

-  £,  V 

•  Ej  £ 


(45) 
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Notice  that  tha  preceding  proof  la  a  constructive  one. 
Moreover,  it  ia  possible  tc  write  a  prograa  that  performs  the 
factorizations,  if  (4S)  ia  uaed.  Mote  also  that  the  factorization  ia 
unique  only  up  to  unimodular  eatrieea,  for 


*  -  (P,B)  (jf'fi,) 


ia  another  acceptable  factorization,  for  any  unimodular  matrix  E. 

Now,  given  the  factorization  (40),  let  Lj^,  •  l»i^  be 
the  residue  classes  as  defined  in  Chapter  II.  The  direct  sum  of  the 
2  sets,  ♦  Lj^,  is  defined  to  be  the  set  of  couples  (j>,  £) 
with  2.  c  Lx_/p)  and  In  thi*  *et'  addition  of  vector  and 
multiplication  of  a  vector  by  a  matrix  A_  are  defined  by 


(£,,  a,)  ♦  <£2,  a2>  “  <£,  ♦  £2’*  3)  4  V 

mp»  a>  -<*£'*  (46j 

Theorem  11  (Chinese  Remainder  theorem  for  lattices}!  Lj^  is  iso¬ 
morphic  to  the  direct  sun 

Prooft  The  proof  consists  of  providing  a  map  f  from  Liyn  to  th* 
direct  sum,  which  is,  first,  a  group  honoaorphism,  and  second. 


St-90 
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one- to- on*  an<i  or.').  L»t  us  d*( in*  p  as  follows: 

Tin)  «{£,£> 


where 


(«7) 


£  =  £  (SOd  ) 
£  =  n  (mod  Q2) 


It  is  straightforward  to  check  that  if  Fiji])  ■  (_£j »  £, )  and  F(_nj)  * 
(£2#  £3*  then  "  <£j'*Ea*  Sj-*32>.  Also,  we  have  P(h  n)  - 

(A_j >,  A  _g) .  Thus,  F  satisfies  the  requirements  to  be  a  group  homo¬ 
morphism. 

The  proof  of  the  one-to-one  property  is  much  more  involved  and 
requires  some  of  the  theorems  presented  in  Chapter  II.  To  be  more 
specific,  we  need  to  show,  since  the  spaces  are  linear,  that 
P(ni)  •  (_0,  OJ  if  and  only  i f  _n_  ”  _0_.  Stated  in  another  way,  the  fol¬ 
lowing  must  be  true:  _n_  e  if  and  only  if  _n  c  1^  and  _n_  c  L^. 
Translated  in  set  rotation,  it  must  be  true  that  *  Lp  n  Lq  . 

Notice  that  in  (40),  N_  is  written  as  a  right  multiple  of  both 
Fj  and  Q?.  Then,  theorem  2  in  Chapter  II  implies  that  1^,  is  a  sub- 
lattice  of  both  and  1^.  In  other  words,  lN  is  contained  in 

%  "  V 

let  tjj  denote  the  greatest  common  sublattice  of  Lp1  and 
i  e  t  m  lp  n  In  the  previous  paragraphs,  we  have  shown  that 

C  Ljj.  Theorem  3,  in  Chapter  II,  implies  that  is  the  lcrm  of  P| 
and  Qj.  Then,  since  N_  is  a  right  multiple  of  P_,  and  Q^,  and  by 


2-V 


definition  of  the  lorn,  M  is  i  right  multiple  of  D.  Therefore,  there 
exists  a  matrix  S_  such  tha  N_  »  D_  S_.  Then,  by  the  property  of 


determinants. 


Idet  N|  >  |det  D| 


On  the  other  hand,  Idet  DJ  is  the  1cm  of  Idet  P_,l  and  Idet 
Q?l .  But  Idet  Pj  I  «  p,  Idet  Qjl  »  q  and  p  and  q  are  relatively 
prime.  This  clearly  implies 


Idet  £  I  •  pq  «  n 


Idet  D  I  ■  Idet  N 


(49)  combined  with  the  fact  that  C  Lp,  implies  that  Ljj  «  Lp,  which 
proves  the  one-to-one  property. 

Now,  let's  examine  the  onto  property  of  the  map  F.  In  our 
case  this  property  is  automatic.  Indeed,  the  equality  and  finiteness 
of  the  number  of  elements  in  our  two  sets,  combined  with  the  one-to- 
one  property,  imply  the  onto  property. 


He  take  the  transpose  of  N_  in  (40)  to  obtain 


Jt  _  T  T  _  T_  T 

‘  h  Sa 
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Applying  the  Chinese  Remainder  theorem  to  NT  yields  a  formula 
that  will  be  equally  useful  in  subsequent  sections. 


(51) 


As  an  application  of  the  CRT  theorem  to  the  MDFT,  we  briefly 
state  that  the  signal  domain  index  _n_  can  be  mapped  into  a  pair  of 
indices  {n_,,  n^)  without  altering  the  values  the  MDFT  takes. 
Similarly,  the  frequency  index  _k_  is  also  mapped  into  a  pair  of 
indices,  (kj,  k2) .  These  transformations  form  the  basis  of  the  tech¬ 
niques  that  will  be  developed  for  the  efficient  evaluation  of  MDFTs. 

As  in  the  1-0  case,  the  mapping  F  is  not  unique.  Choosing  the 
right  indexing  scheme,  i.e.  selecting  F,  will  become  a  significant 
part  of  the  construction  of  a  fast  algorithm.  Burrus  (34)  was  able 
to  derive  the  general  form  of  the  mappings  F,  in  the  1-0  case. 


Prime  Factor  Algorithm 

This  section  addresses  the  central  problem  of  developing 
practical,  fast  and  efficient  algorithms  for  the  evaluation  of 
MDFTs.  We  describe,  here,  a  method  which  is  a  generalization  of  the 
1-D  Prime  Factor  algorithm  (PFA)  (9).  The  idea  behind  the  technique 
is  the  same  as  for  its  1-D  counterpart:  compute  long  transforms  by 
combining  a  set  of  short  length  transforms.  The  Cooley-Tukey  al¬ 
gorithm  uses  the  same  principle,  but  now  the  lengths  of  the  short 
OPTs  are  required  to  be  mutually  prime.  The  primeness  condition, 
together  with  the  results  discussed  in  previous  sections,  allcws  the 


establishment  of  new  algorithms 
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A  detailed  description  of  the  algorithm  is  given  next.  The 
description  involves  two  factors  only,  for  clarity,  but  the  technique 
can  be  straightforwardly  generalized  to  include  more  than  two 
factors.  The  algorithm  will  be  called  the  Matrix  Prime  Factor  al¬ 
gorithm  (MPFA). 

Consider  an  KDFT  with  periodicity  matrix  N: 


X(k)  “  l  x(n)  exp{-j2*k_TN  1  n_) 


HeLl/M 


(52) 


ieLi/« 


Assume  that  the  length  of  the  KDFT,  Idet  KJ ,  is  the  product  of  2 
relatively  prime  integers,  p  and  q.  p  and  q  represent  the  lengths  of 
the  short  DFTs  which  are  going  to  be  combined.  Multidimensional 
index  maps  will  be  used  to  convert  couples  of  indices  into  the 
indices  used  to  access  the  input  and  output  data  arrays. 

As  we  mentioned  in  the  previous  section,  N_  can  be  factored  as 


>i2, 


(53) 


with 


Idet  P? I  -  Idet  Pjl  ■  p 
Idet  Q,  I  ■  Idet  Q„l  -  q 

— 1  — A 


The  Chinese  Remainder  theorem  we  have  just  described  asserts  tha 


*20. 


is  isomorphic  to  the  direct  sume  ^i/p^  *  *  ctuc*al  step  in 


<p-?v 


-•y .  w .,m<>»i^  »»«>;»*»  aa<lpaaW»W~<M ■  <PH««IU*HWmW» 


the  design  is  how  to  select  the  isomorphism.  The  isomorphic  nap  has 
to  be  a  one-to-one  and  onto  correspondence  between  the  collection  of 
indices  (jv,,  i^)»  where  _n,  belongs  to  and  JJj  to  and  the 
index  _n_  in  L^^,.  Denoting  the  nap  ?(n_, ,  n?) ,  any  sum  at  ion  over  the 
simple  index  _n_  is  replaced  by  a  multiple  sun  over  the  variables  jv,, 
n?,  if  n_  is  replaced  everywhere  by  F(n_,,  ji^) .  We  nay,  then,  define  a 
new  multidimensional  array  by 


yin,,  n2?  »  x(f{n1  ,n2>) 


and  y  is  substituted  for  x  when  the  simple  summation  is  replaced  by  a 
multiple  sum.  But,  it  is  often  convenient  to  substitute  the  letter  x 
for  y  and  no  confusion  should  result  since  x  and  y  have  different 
arguments. 

Similarly,  a  second  map  should  be  produced  for  the  output 
indices.  It  is  a  one-to-one  correspondence  between  the  collection  of 

indices  (Jr,,  J^J ,  where  Jr,  belongs  to  and  to  /Q,*'  and 
T 

the  index  k_  in  .  Denoting  the  map  as  G(k_, ,  k?) ,  the  output  X(JO 
is  recovered  from  the  multidimensional  output  array  X(Jtj,  _kj)  by 
mapping  backwards  through  the  map  G.  The  exact  equivalence  is 


X(k)  -  X(<f\k,,  k2() 


Due  to  the  particular  forms  of  the  maps  F  and  G  and  their 
interactions  with  the  indices,  the  kernel  of  the  resulting  multiple 
summation  will  be  separable,  yielding  a  great  savings  in  computa- 
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tional  effort.  The  success  of  the  MPPA  method  stems  from  this 
separability. 

In  this  section,  the  maps  P  arid  6  are  chosen  to  be  different, 
for  demonstration  purposes.  In  the  next  chapter,  in  our  actual  im¬ 
plementation  of  the  algorithm,  practical  considerations  lead  us  to 
select  identical  maps  for  P  and  G. 

Let  us  select  the  map,  _n  »  P(n^,  to  be: 


n  =  Q2£_1n,  ♦  P,  3-1n2  (nod  N) 


where  _U  is  the  unimodular  matrix  derived  when  writing  N_  in  Smith 
Normal  form  as  N_  ■  0_  l>  V. 

We  need  to  prove  the  one-to-one  property  of  the  map.  Because 
Qj  and  P_,  have  relatively  prime  determinants,  we  may  assume,  with  no 
loss  of  generality,  that  _n-j  =  J2.  (nod  Qj) .  We  want  to  show  that  if  _n 
satisfies  both 


n  s  Q2U  'n,  (mod  N) 


n  =  5j£  m,  (mod  N)  (57) 

where  jij  and  mj  belong  to  ,  then  it  must  be  true  that  _n,  =  m, 

(mod_P,).  Combining  (56)  and  (57)  yields 


H'  -  '-V*  ;V. 


r^-96 


which  translates  into 


QjtT’n,  *  2i£-1®i  ♦  S.  L 

for  seme  integer  vector  r_.  We  multiply  both  sides  of  the  equality  by 
U  Q?-1,  we  have 

n,  -  m,  ♦  U  Q2~’n  r  (58) 

Due  to  the  fact  that  N_ -  Q?P?,  we  see  that 

",  *  “  *2  l 

Finally,  from  the  relation 

£  Pj  «  P,  V  (59) 

(recall  that  £.1  -  ££,  and  P^  -  V) ,  we  obtain 


which,  since  V_r_  is  an  integer  vector,  is  equivalent  to 
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n,  *  ■,  (»od  Pt)  (60) 

(CO)  proves  eh*  one-to-one  property  of  the  aep  P. 

As  we  have  already  Mentioned'  the  onto  property  is  implied  to 
by  the  one-to-one  property  and  the  finiteness  of  the  sets.  Thus,  P 
is  a  legitimate  isomorphism  map. 

The  output  map  S  is  slightly  more  complicated  but  the  same 
method  is  used  to  shorn  that  it  satisifies  the  necessary  re¬ 
quirements.  tt  is  given  by 

Jr  3  A,  R,  *  t.2  -2  (nod  ■*>  (6»> 

where  A_,  and  A^  are  integer  matrices  satisfying 


a,  i) 2  -i  (■od  £aT> 

Aj  k2  =  i2  (mod  0,T> 


(62) 


and 


a,  -  a,  a, 

^  -  z2\ 


(C3) 


where  Bj  and  Bj  are  some  integer  matrices.  (63),  essentially,  states 
that  Aj  is  a  right  multiple  of  QjT  and  A j  a  right  multiple  of  P,T. 
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Ne  substitute  (55)  and  (81)  into  the  exponent  of  the  MOPT  (52) 

to  jet 


kV’n  -  (A,^*  *2-2,T-"1  <a2-_1-l  ♦  P,u' ’n*1 


which ,  when  expended,  becomes 


T  -1  T  -1  -1 

k  N  n  -  (A,!,)  N  n,  ♦ 

4  2  * T— ” '  2-2— ~ 1  — 1  * 

Ne  examine  now  each  term  in  the  right-hand  side  of  identity  (64). 
Noting  that  N  «  Q?  the  first  term  yields 

*  <*,jL,)T22'1fi2'1fi2£',il1 

and,  since  Qj” ■  1,  the  identity  matrix,  then 

-  (A1Hl)TP2‘V1n1 

Osing  (62),  we  get 

(A1k1)TN~1Q30~,n1  5  k^p^V1^  (mod  Pj)  (85) 


T  -1  -1 

<*!*!>  !i  £|£  H2 

T  -1  -1 

<i2V  £  H2 


1h«  second  tera  of  (64)  is  considered  next.  We  substitute  PjQj  for  » 
to  obtain 

> V1*, if ^  “  <^ilL1)T2i“1£t“1£1Il”,Il2 

hence , 

T  -1  -t  T  -1  -1 

(A  k^)  -  £,11  "2  “  <i,ii  >  S,  £  D-2 

which,  by  the  transpose  property,  becomes 

<£,Ji,)TIi~1£1“"1Il2  *  JL1T£,Tai’1£1Ji2  (66> 

with  the  aid  of  (63) .  (66)  can  be  expressed  as 

'llif  ^  *  jLiT£iTfi.,ai"1jf1i!2 

hence 

>T**  ' P.,0  'rjj  *  t., T£, T£ 

Notice  that  the  right-hand  side  of  identity  (67)  is  an  integer 
number  since  0~*  is  unimodular,  thus 
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and 


e«pl-J2«(A  k^V’PjBjl  -  1 


(68) 


Poliowing  the  san«  process,  we  find  that  the  regaining  terns 
in  (4)  can  be  expressed  as 


(^2-2,T-"1-2-"1-1  E  £  (nod  l) 


(69) 


and 


T  -1  -1  T  -1  -1 

(*L2i2)  £  P,U  n2  -  t2  fi,  u  n2 


(70) 


Next,  we  substitute  (6$),  (68),  (69)  and  (70)  into  (69),  Then 
the  mdft  (52)  can  be  expressed  as 


<(£,#  *.2>  -  l  I 

-1CLL/P,  -2eLI/02 


x(n,,  IS.2*  J2*jL,T£a-1o-1£L1 1 


(71) 


k.  *  L.  * 
i/Pj 


-2  C  Li/2, 


•  expt-j2«k2T21”,£~,n2l 


We  define  new  matrices,  p  and  Q  as  follows 


£-101 
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•  •«pt-j2*k,T2  n  1 

k  C  L  T 
— t  l/P 

±2  * 


Finally,  to  evaluate  (75),  let  us  express  it  in  two  pacts 


C(n,,  k2)  -  l 


n.  £L 
— « t  I/J 


T  -1 

xtn,,  ^  «*Pl-j2»k,2  2.  S.2* 


(76) 


XU,,  k2 


I  C(n_, ,  lt^ )  expf- j2wjt,Tp_,n,, ) 

n  CL  _ 

— 1  £/P 


(77) 


The  summation  in  (76)  represents  a  collection  of  MDFTs  with 
periodicity  matrix  £.  A  different .matrix-Q  DFT  must  be  evaluated  on 
the  array  x(_nj,  n^)  for  each  value  of  the  vector  _nj.  This  means  that 
the  number  of  matrix-Q  DFTs  is  p.  The  summation  indicated  in  (77)  is 
another  collection  of  MDFTs,  on  the  array  C(_n, ,  Jtj) .  For  each  value 
of  the  index  k^.  a  matrix-!*  DFT  is  evaluated.  Therefore,  equation 
(77)  representes  q  aatrix-P.  DFTs.  Thus,  we  have  reduced  the  matrix-!^ 
DPT  into  a  set  of  smaller  lenqth  MDFTs.  By  comparison  with  the 
matrix  Cooley-Tukey  algorithm,  we  notice,  as  in  the  1-D  case,  that 
there  are  no  twiddle  factor  multiplications.  This  technique  can  be 
extended,  in  a  straightforward  manner,  to  Include  more  than  two 
factors. 
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One  important  observation  to  ba  made  is  that  N,  P  and  Q  have 
the  same  unintodular  matrices  in  their  normal  decomposition.  Indeed, 

°2  - 

P  •  n  D,  V  (78) 

fi  ■  CDj  V 

The  importance  stems  from  the  fact  that  0_  and  are  involved  in  input 
and  output  reindexing,  respectively.  Thus,  the  matrix-P_  DFTs  and  the 
matrix-^  DFTs  may  share  <.nose  parts  of  the  alqorithm  that  are  related 
to  JO  and  V. 

If  S  is  a  diagonal  matrix,  then  U_  and  become  identity 
matrices,  and  the  MDFT  reduces  to  a  conventional  rectangular  DPT. 
The  MPFA  procedure  is  significantly  simplified  and  the  result  re¬ 
sembles  the  complex  techniques  developed  by  Nussbaumer  |4]  using 
polynomial  transforms.  Thus,  those  algorithms  can  be  derived  as 
special  cases  of  our  general  algorithm. 

He  deduce,  next,  the  number  of  multiplications,  K,  and 
additions.  A,  which  result  from  computing  the  matrix-N  DPT  by  the 
MPFA.  let  M,  and  A^  (Mj  and  Aj)  be  the  number  of  multiplications  and 
additions,  respectively,  required  to  calculate  the  matrix-^P  DPT 
(matrix-Q  DPT) .  Then,  clearly, 

M  -  q  M,  ♦  pM2  (78) 

h  -  qA,  pA2  (80) 


J2.-/OV 


Note  that  (79)  represent*  an  upper  bound  only,  since  some  of 
the  multiplications  may  be  trivial  operations  (multiplications  by 
unit  elements) . 

In  the  next  section  we  will  give  a  detailed  description  of  a 
simple  example  to  illustrate  all  the  characteristics  of  the  MPFA 
technique. 

It  is  important  to  emphasise  that  the  KPFA  constitutes  only 
one  way,  among  others,  for  implementing  equation  ( 75) .  It  is  a 
row-column  type  of  computation,  where  the  concepts  of  'row*  and 
'column*  are  expanded  to  have  meaning  in  the  case  of  multidimensional 
data.  for  the  multidimensional  data  array  x  ,  n?)  a  column 

consists  of  the  data  taken  by  holding  the  first  data  index  fixed 
and  letting  ^  take  all  its  possible  values..  A  row  is  obtained  by 
fixing  n^  and  varying  n^,.  We  will  describe  in  the  last  section  of 
this  chapter  a  method  for  organizing  the  data  array  into  rows  and 
columns,  where  this  time  the  terms  'row*  and  'column*  do  have  the 
traditional  meaning. 

Thus  equation  (75)  must  not  be  viewed  as  the  end  product  of  an 
algorithm  design  process,  but  rather  as  the  branching  point  for 
possibly  many  FPT  algorithms.  In  addition  to  the  PFA  another  highly 
efficient  algorithm  for  the  evaluation  of  rectangular  DFTs  is  tt.« 
Hi nog rad  Fourier  Transform  algorithm  (WFTA)  [3].  It  is  based  on  a 
nesting  algorithm  proposed  by  Winograd  [10].  In  this  approach,  all 
the  multiplications  steps  of  the  short  DFTs  are  combined  (nested) 
into  one  multiplication  step.  The  WFTA  displays  superior  mul'.iply- 
add  characteristics  which,  often,  make  it  preferable  to  the  FFA. 


£’!0$ 
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Among  its  disadvantages ,  we  cite  its  slightly  higher  number  of 
additions  and  its  higher  memory  requirement.  Johnson  and  Burrus 
(35),  have  recently  shown  that  the  PFA  and  the  WFTA  belong  to  a  much 
larger  class  of  OFT  algorithms.  This  general  class  can  be  generated 
by  systematically  nesting  the  short  OFT  modules  in  all  possible 
manners. 

We  will  conclude  this  section  by  showing  how  nesting  can  be 
applied  to  evaluate  the  MDFT  (75).  The  derivation  given  here, 
parallels  essentially  the  one  given  by  Nussbaumer  (4)  for  rectangular 
DFTs. 


Let  X_ 


(n, )  be  an  indexed  column  vector  with  p  elements. 


X  <n,J  -  X(nt,  n2) 


for  each  jij  *n  ^i/QS  *n^  *et  —  *>•  *  P  bY  P  matrix  of  the  complex  ex- 

T  —1 

ponentials  exp(- j2*Jr^  P_  JJj ]  where  the  rows  are  indexed  according  to 
Jr,  and  the  columns  according  tojn^.  Then,  (75)  can  be  expressed  as 


X(k,,  k^) 


-2tLI/2, 


expl-jZxlrjV1^!  *  *„ 

—2 


(81) 


Equation  (81)  represents  a  matnx-Q_  DFT  where  each  multiplication  by 
exp  (-  j  2*k  -yTQ~  1  Is  replaced  by  a  multiplication  by 

♦xpt-J^sk^Q’^nylA.  But,  due  to  the  particular  form  of  A  the  latter 
is  in  reality  a  matrix-P  DFT  where  each  multiplication  by 
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«xp(~j2»fcjTP~,nj 1  Is  replaced  by  a  multiplication  by 
exp[-j2wk,TQ~1n7l  expl- j2«|tjTPf  V,  l  •  Thus,  since  the  matrix-Q  DFT 
ir'  -  'es  Mj  multiplications,  there  are  Mj  such  matrix-^  OFTs. 
Therefore,  the  total  number  of  multiplications  K  is 

M  -  (82) 

where  M1  is  the  number  of  multiplications  for  a  matrix-^  DFT.  The 
total  number  of  additions,  on  the  other  hand,  is  equal  to 

A  -  MJA1  ♦  p  A2  (83) 

where  Aj  and  A2  are  the  number  of  additions  for  matrix-P_  and  matrix-Q 
DFTs,  respectively. 

It  has  been  shown  (4],  that,  in  comparison  with  the  PFA,  the 
WFTA  reduces  the  number  of  multiplications  by  as  much  cs  a  factor  2 
for  some  lengths,  while  requiring  a  slightly  larger  number  of 
additions.  Since  equations  (82)  and  (83)  are  valid,  whether  the  MDFT 
is  rectangular  or  not,  the  comparison  still  hold  in  the  general  case. 

Hybrid  algorithms  that  combine  the  structures  of  the  PFA  and 
the  WFTA  can  also  be  extended,  in  a  equivalent  manner,  to  the  MDFT 
case. 


Aw  Example  of  the  HFFA  Algorithms:  The  Hexagonal  PFA 
The  MPFA  technique  is,  perhaps,  best  understood  through  the 
examination  of  an  example.  After  the  rectangular  DPT,  the  next  most 
important  class  of  DFTs  is  the  hexagonal  DFT  (19).  A  2-0  DFT  that 
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relate*  «  hexagonally-sasipled  signal  to  hexagonal  samples 
Fourier  transform  has  a  periodicity  matrix 


N 

2N 


] 


The  Smith  Normal  decomposition  of  N  is 


2 

-1 


J 


2 

We  let  p-3  and  q»N‘j  if  (3,N)  *  1,  then  (p,  q)  «  1.  Then  a 
ration  of  N_  as  in  (40)  is  possible,  with 
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In  Figure  10  we  show  the  regions  ^i/Pj  and  Li/gt <  f°r  the  case 
of  M«2.  The  residue  classes  are  represented  here  by  the  integers 
inside  the  parallelograms  formed  from  the  columns  of  the  respective 
matrices.  This  representation  has  been  selected  because  we  are 
familiar  with  it,  although,  perhaps,  it  is  not  the  best  represen¬ 
tation  possible.  In  fact,  */e  will  provide  in  the  nest  chapter  a  much 
better  representation  which  significantly  facilitates  the  indexing 
and  reindexing  steps  of  the  algorithm.  For  now,  we  thus  have 


li/n  “  l(0  0)T'  (1  1)T'  (2'  2)T'  <3  3,T'  (4  4)T'  (S  5,T' 


T  T  T  T  T  Ti 

(2  1)  ,  (3  2),  (4  3)T,  (1  2)  ,  (2  3)\  (3  4)  } 


(87) 


LI/P,  *  *(0  0,T*  <’  *>  '  »2  °>  I 


(88) 


lt/o  "  l  <0  0)T,  (1  0)T,  (2  1)T,  (3,  0T} 


1/52 


(89) 


The  next  step  is  to  compute  the  mapping  represented  by  (55) 
between  the  elements  of  and  the  elements  of  the  direct  sun  tjy 

♦  The  correspondence  between  the  indices  _n^  and  the  indices 

(n_,,  nj)  is  found  to  be: 
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Next,  the  output  napping  (61)  must  be  performed.  Note  that  since 
is  syiunetriCf  Lj^nt  “ 

Once  the  mappings  are  found,  a  partial  flowchart  for  the  al¬ 
gorithm  can  be  drawn,  which  is  done  in  Figure  11.  Each  matrix-Q,  DPT 
operates  on  a  subset  of  the  input  array.  Indexed  by_nj.  The  results 
are  fed  to  the  matrix-Pj  DFTs .  The  matrix-Q,  DVT  is  a  four-point  DFT 
and  the  aatrix-Pj  DFT  is  a  3-point  DFT.  The  flowchart  of  these 
matrix-DPTs  is  given  in  Figure  12. 

This  example  illustrates  fairly  well  the  form  of  an  HPFA  pro¬ 
gram.  One  part  of  the  program  should  be  concerned  with  the  input 
indexing.  Another  part  should  realise  the  output  indexing.  There 
should  also  be  a  list  of  subprograms  each  of  which  correspond  to  a 
short-length  DPT.  Large  computation  savings  can  be  made  if  those 
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short  DPTs  are  optimised  with  respect  to  the  number  of  multipli¬ 
cations.  The  UDV  algorithm  should  be  used  to  compute  those  short 
DPTs.  Rectangular  short  DPTs  are  then  obtained  which  can  be  evaluted 
in  a  simple  way  by  the  row-column  method.  A  sore  complicated 
approach,  but  one  which  results  in  greater  savings,  would  be  to  use 
the  faster  vector-radix  algorithm  or  even  the  very  fast  (N  by  N)  DPTs 
developed  by  Winograd  [33],  for  N  prime.  More  will  be  said  about 
this  in  the  next  chapter. 

Rectangular lsat ion  of  the  Indices i 

Xn  the  evaluation  of  matrix  DPTs,  one  must  keep  track  of  both 
the  time  and  frequency  indices.  An  advantage  of  this  is  the 
elimination  of  the  problem  of  the  ordering  of  the  output  data 
array.  The  address  of  the  output  sample  is  automatically  known  and 
is  given  by  the  frequency  index  array.  However  some  memory  space 
must  be  reserved  for  that  array  and  that  amount  of  space  may  quickly 
become  quite  prohibitive  for  large  MDFTs. 

In  the  rectangular  case,  the  programming  task  is  greatly 
facilitated  by  the  implicit  nature  of  the  indices.  Simple  nested 
*do*  loops  (in  Fortran)  can  generate  all  the  indices  if  the  lengths 


of  the  DPT  in  each 

dimension  are 

known. 

On  the 

other 

hand,  the 

explicit  nature  of 

the  indices 

in  the 

general 

MDFT 

make  the 

programming  task  more 

difficult. 

To  alleviate  these  problems,  a  procedure  will  be  developed 
which  permits  the  representation  of  indices  in  a  rectangular  form 
even  for  nonrectangular  DPTs.  Without  loss  of  generality,  let's 
consider  a  2-D  matrix  N  DPT.  Write  N  in  terns  of  columns 
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«  ’  <£,  £2’ 

Then,  we  apply  theorem  4  in  Chapter  II  which  state3  that  there  are 
integer  vectors  r,  and  Xj  and  integers  g^,  g21,  g22  such  that 


£l  "SL11  “1 

£2  “  3-21  -1  +  3-22  -2 


Moreover  the  set  of  vectors 


ui  £1  *  “2  2a 

"  0,1,...,gir1 
" u2  * 


(93) 


constitute  a  representative  system  for 

Observe  that  if  we  know  _Xj ,  Xj ,  g, .  and  g22  then  we  will  be 
able  to  generate,  quite  simply,  all  the  indices  by  varying  u,  and  u2 
in  (93).  This  process  resembles  the  process  of  generating  the 
indices  in  the  rectangular  case.  For  that  reason  we  call  it  the 
rectangular itation  process. 

The  same  process  may  be  applied  to  the  matrix  to  generate 
the  frequency  indices.  That  is,  we  can  find  integer  vectors  ^ 
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and  integers  1,,  and  l22  from  the  column a  of  nJ  such  that  the  set  of 
vectors 


(94) 


*1  *1  ♦  »2  *2 

v,  -  •»* . 11r1 

Vj  -  0,1,. ..,1^-1 


generate  the  output  indices. 

He  substitute  (93)  and  94)  into  the  MDFT  equation  to  gets 


x(v1# 


V 


i  l  x(u  ,u  )  exp(-j2»(v  V  ) 
u^”0  u2-0 

{ <XlXj)  Tn"  ’  l*,^)  1  (“  1U2)  Tj 


(95) 


Vj  ■  0,1,... ,1^-1 


v2  “  o,i,...,i22-i 


Equation  (95)  illustrates  clearly  the  relationship  between  nonrectan- 
gular  DFTs  and  rectangular  DFTs.  It  shows  that  while  the  rectangular 
DPT  is  a  special  case  of  the  MDFT,  the  MDFT  can  be  considered  to  be  a 
special  case  of  the  rectangular  OFT.  This  seemingly  paradoxal  state¬ 
ment  can  be  understood  by  noting  that  the  different  representations 
are  mainly  convenient  choices  for  organising  the  numbers  that  repre- 
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sent  data.  To  see  it  laora  clearly,  assume  that  g22  -  122  "  then 
the  variable  u2  and  v2  vanish  and  (95)  becones 

x(v,)  -  l  x(Uj)  e*p(-j2s(2r1TM",xt)u1v1)  (96) 

tij»0 

X  -i 

the  tern  N  will  reduce  to  t/ldet  N_l ,  where  t  is  some  integer 
which  satisfy  (t,  Idet  N)  I  ■  1.  Moreover,  gn  ■  ln  ■  Idet  N_| . 
Thus,  we  have 


Idet  M  1-1 

*<V  ■  l  *<»»,)  ««PM  idet  wT  fc  “iV  (97) 

u  »0  — 


(97)  represents  a  1-D  OFT  where  the  output  index  v1  is  perauted  by 
the  integer  t.  Thus  the  MDFT  has  become  a  perauted  1-D  OFT. 

We  conclude  this  section  by  noting  that  by  writing  the  MDFT  as 
in  (95),  we  reintroduce  the  problem  of  non-ordering  of  the  output 
samples.  Indeed,  the  output  resulting  from  (95)  will  not  be  in  the 
same  order  as  the  input,  and  consequently,  an  additional  unscrambling 
step  is  necessary. 

Extensions  to  the  Watria-Cooley-Tufcey  Algor  1 that 
We  have  already  mentioned  in  Chapter  I,  the  Matrix-Cooley- 
Tukey  algorithm  developed  by  Mersereau  and  Speake  [20).  Like  the 
MPFA,  it  combines  short  length  DFTs  to  compute  long  DFTs.  However, 
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it  does  not  require  the  length*  of  the  short  DPTs  to  be  relatively 
prime  to  each  other.  Thus,  a  useful  application  of  the  algorithm  is 
for  MDKTs  which  have  for  their  length  a  power  of  a  prime.  The  MPPA 
has  a  simpler  structure  and  does  not  contain  twiddle  factors.  For 
these  reasons,  the  matrix-Cooley-Tukey  algorithm  is,  in  general,  less 
efficient  than  the  MPFA,  but  it  offers  a  much  better  variety  of 
possible  lengths. 

Recall  that  the  input  indexing  present  in  the  natr ix-Cool:y- 
Tukey  algorithm  is  realized  by  the  mapping 


J!  5  Z.  S.  ♦  R  (mod  N) 


(98) 


where  n_  e  and  £_  and  belong  to  the  sets  Ip  and  Iq,  re¬ 
spectively.  Ip  and  Iq  are  described  in  (21)  in  terms  of  cosets  and 
matrix-sampling  of  sets-.  This  description,  while  valid,  does  not 
allow  for  easy  implementation.  Except  for  some  special  cases,  such 
as  the  rectangular  and  the  hexagonal  cases,  it  is  rather  difficult  to 
construct  the  sets  Ip  and  Iq  in  a  systematic  fashion.  Our  goal  in 
this  section  is  to  use  our  knowledge  gained  from  the  MPFA  to  present, 
hopefully,  better  alternatives. 

Instead  of  Ip  and  Iq  as  defined  above,  let  us  select  LI/P  and 
I*I/q,  i.e.*  let  £_  e  ^X/P  and  3.  8  li/q*  We  must  show  that  the  mapping 
(98)  Kirks,  that  is,  it  must  be  one-to-one. 

Assume  that  n  satisfies  both 
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n  P  a,  ♦  £,  (mod  H) 

(99) 

n  gj  +  £2  (mod  H) 

where  £j  and  £j  belong  to  Lj^p  and  and  q^  to  LI/(q.  Then 

Z  S,  +  Ei  *II2  ♦  E2  +  Z  I. 

for  some  integer  vector  r_»  then 

£,  '  e2  ♦  £.  <a2  -  a,>  +  ii 

(100) 

He  substitute  N  *  P  Q  in  (100)  to  get 

£,  *  £2  ♦  £  %  -  3)  +2L> 

(101) 

(101)  is  equivalent  to 

£j  Pj  (mod  P) 

(102) 

that  is  £j  is  equal  to  Pj  in  We  use  (102)  in  (99)  to  get 

pg.,  » p  a2  ♦  p  a  t. 

i 

(103) 

He  multiply  both  sides  of  (113)  by  P-'*  then 

3i  “  32  ♦  £  £ 

(104) 

thus  and  ^  »r*  equal  in  (101)  and  (104)  together  show  the 
one-to-one  property  of  (98) . 

If  we  look  at  the  examples  given  in  (21),  we  find  that  indeed 
the  sets  selected  satisfy  our  construction.  This  construction  could 
be  implemented  in  a  fairly  systematic  way  by  the  rectangular izat ion 
process  developed  .n  the  previous  section.  Another  implementation  is 
to  use  the  Smith  Normal  decomposition  of  the  matrices  N,  P  and  0. 
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raAPTBR  IV 

new  rrr  implementations 

In  the  present  chapter,  we  attempt  to  cover  the  practical  side 
of  the  thesis.  The  object  of  attention  will  be  a  set  of  computer 
subroutines  that  solve  the  MDFT.  Specifically,  in  the  first  section 
we  apply  the  techniques  developed  in  Chapter  III  to  construct  an  MPFA 
procedure.  A  detailed  discussion  of  the  algorithm  reveals  the 
existence  of  a  computation*1 ly  elaborate  step  in  the  process.  This 
step,  represented  by  equation  (55),  corresponds  to  the  determination 
of  the  input  samples  to  the  short  DFTs.  The  difficulty  resides 
mostly  in  the  (mod  N)  operation  but  it  is  also  the  result  of  the 
various  matrix  multiplications  present  in  (55).  The  (mod  N) 
operation  is  essentially  a  multidimensional  vectorial  operation,  and, 
as  such,  is  highly  dependent  upon  the  selected  set  of  vectors  L.^. 
From  a  detailed  examination  of  the  problem,  we  will  be  able  to  pro¬ 
vide  a  satisfactory  solution  to  it.  The  critical  part  of  the 
solution  will  consist  in  selecting  the  "right*  set  of  residues  Lj^. 

In  section  two  we  carry  out  a  comprehensive  evaluation  of  the 
MPFA  subroutine.  The  algorithm  is  tested  for  a  wide  variety  of 
allowed  lengths  and  also  for  a  variety  of  forms.  It  is  compared  to 
two  other  algorithms.  Since  an  FFT  must  be  fast,  the  principle  com¬ 
parison  of  the  various  algorithms  will  be  according  to  computational 
speed. 
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The  MPFA  is  a  general  purpose  FFT  algorithm.  The  generality 
is  with  respect  to  two  aspects:  first,  it  involves  variable  lengths, 
and  second  it  accommodates  variable  forms,  as  was  already  mentioned 
in  the  previous  chapter.  Because  of  this  complete  dual  generality,  a 
loss  of  efficiency  is  present  due  to  the  necessity  of  tradeoffs.  If 
we  drop  the  requirement  to  include  all  forms,  then  we  can  reasonably 
expect  to  improve  on  the  efficiency  on  the  program.  This  is  indeed 
the  case  with  the  rectanqular  MPFA.  In  section  three  we  will  show 
how  to  design  an  MPFA  with  a  .ion- rectangular  specific  form.  In 
particular,  we  will  describe  a  nixed- length  hexagonal  Prime  Factor 
Algor  ithm  (HPFA) . 

The  modules  in  the  MPFA  are  evaluated  according  to  the  0^  V_ 
technique.  Moreover,  the  resulting  matrix-D_  DFT  is  computed  with  the 
simple  row-column  method.  As  mentioned  before,  an  alternative 
approach  is  to  use  a  nesting  method.  If,  for  .instance,  winograd* s 
nesting  is  used  then  we  would  have,  to  stay  consistent  with  our  no¬ 
tation,  a  Matrix  Winograd  Fourier  Transform  Algorithm  (mwfta) . 
Nesting  improves  the  computational  load  of  the  algorithm,  but  it  also 
complicates  its  control  structure. 

Neither  the  row-column  nor  the  nesting  approaches  are  the 
fastest  known  algorithms  for  evaluating  matrix-I)  DFTs.  For  instance, 
Ausslander,  Feiq  and  Winograd  have  recently  proposed  a  very  efficient 
method  for  evaluating  rectangular  p  by  p  DFTs  (where  p  is  a  prime 
number).  In  section  four  we  describe  an  algorithm  which  will  allow 
us  to  include  new  algorithms  as  they  become  available.  The  approach 
takes  advantage  of  the  modular  nature  of  the  MDFT.  A  related  problem 
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is  that  of  selecting  the  most  appropriate  matt  ix-N_  which  can  be  used 
to  compute  the  OFT  of  a  given  finite  extent  multidimensional  se¬ 
quence.  The  answer  is  dependent  upon  not  only  the  shape  of  the 
sequence  but  also  upon  the  FFT  algorithm  at  hand.  We  will  validate 
the  answer  by  considering  a  simple  example  and  using  the  MPFA. 

As  we  have  already  mentioned  in  the  introduction,  we  are 
voluntarily  limiting  ourcelves  to  the  two-dimensional  case  for  the 
purpose  of  clarity  and  ease  of  presentation.  Nevertheless,  every 
concept  and  technique  that  will  presented  applies  equally  well  to  the 
case  of  more  than  two  dimensions.  For  instance,  starting  from  a 
two-dimensional  MPFA  algorithm,  it  is  straightforward  to  construct  a 
three-dimensional  MPFA  algorithm. 

The  MPFA  Algorithm 

In  this  section  we  propose  a  Fortran  5  implementation  of  the 
MPFA  algorithm  discussed  in  the  previous  chapter.  A  global  flowchart 
of  the  program  is  shown  in  Figure  13.  The  input  to  the  subroutine 
consists  of  the  data  input  array  organized  in  a  ‘satisfactory* 
manner.  We  will  see  shortly  that  this  data  organisation  is  of  utmost 
importance  to  the  reindexing  problem.  Next,  the  factors,  correspond¬ 
ing  to  the  prime  factorization  of  the  periodicity  matrix,  are  ordered 
according  to  length  and  form.  Corresponding  to  each  factor  in  turn, 
a  set  of  Indices  required  for  each  short-length  DPT  is  calculated 
using  the  input  mapping  equation  (55).  Then,  the  data  given  by  these 
indices  is  transformed  by  a  DFT  module  of  proper  length  and  proper 
form.  The  results  of  this  transform  are  replaced  in  the  data 
vector.  If  more  of  the  same  factors  are  required,  then  the  new 
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Figure  13.  Flowchart  of  the  MPFA  Algorithm 


indices  are  calculated.  Otherwise,  the  program  moves  on  to  the  next 
factor.  The  program  terminates  when  all  the  factors  have  been  used. 

Observe  that  the  flowchart  doesn't  include  the  output  reindex¬ 
ing  step  corresponding  to  the  output  map  (61).  One  reason  is  that 
solving  (61)  -  (63)  for  the  matrices  V  and  B_'  is  not  a  trivial 
task.  The  main  reason,  however,  is  that  the  map  (61)  produces  an 
incorrectly  ordered  output.  Consequently,  the  output  sequence  must 
be  rearranged  to  get  the  DfT  in  proper  order.  This  is  analogous  to 
the  reversed-bit  nature  of  the  output  of  the  one-dimensional  radix-2 
FFT. 

To  better  explain  the  concept  of  in-order  calculation,  assume 
the  complex  input  data  is  given  in  arrays  X  and  Y  (real  and  imaginary 
parts,  respectively).  The  program  calculates  the  matrix-N^  DFT  in 
place,  i.e.  the  output  is  written  over  the  input  in  X  and  Y.  The 
in-place  requirement  is  necessary  when  memory  space  is  limited. 
After  the  in-place  calculations  are  made,  the  locations  of  the  output 
MDFT  values  are  given  by  the  input  map  (55)  but  the  frequency  index 
is  given  by  the  output  map  (61).  Thus,  it  is  necessary  to  permute 
the  output  if  we  wish  to  have  it  in  proper  order.  One  benefit  of 
having  a  correctly  ordered  output  is  that  it  is  no  longer  necessary 
to  keep  in  memory  the  addresses  of  the  output  since  they  are  identi¬ 
cal  to  that  of  the  input. 

In  the  one-dimensional  case,  Burrus  and  Eschenbacher  (36)  have 
recently  proposed  a  new  implementation  of  the  PFA  which  allows  the 
transform  to  be  performed  in-place  and  in-order  without  output  un¬ 
scrambling.  The  solution  consists  of  using  identical  maps  for  the 
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Input  and  output  indexing.  However ,  their  approach  is  applicable 
only  to  fixed  size  transforms  since  the  structure  of  each  short  DPT 
becomes  dependent  on  the  size  of  the  full  transform.  Thus,  the  pro¬ 
gram  must  reorder  the  output  of  each  module  and  the  reordering  varies 
according  to  the  transform  size.  Rothweiller  [37)  solved  this 
problem  by  using  distinct  pointers  for  the  input  and  output  data  for 
each  DPT  module.  Our  goal  is  to  extend  these  techniques  to  the  MDFT 
case. 

Consider  again,  the  two- factor  matrix-N_  DPT  of  Chapter  111: 


X(k) 


*eLl/N 


x(r)exp[-j2»  k_T(J  'nj 


(105) 


This  time  we  use  the  following  input  and  output  maps: 

i  ’  82  £  'lli  *  0  1  n.2  (mod  N) 

T  -T  T  — T  T 

k  -  a,  V  k ,  ♦  p‘  V  k2  (mod  N  ) 

If  we  develop  the  various  matrices  into  their  Smith  normal  forms,  we 
get 


n  -  0  Oj  0_1n,  ♦  0  D,  U_1n2  (nod  N>  (106) 


*  -  VT£2  (v” ’ ) *k  (  ♦  VTD,  (V_1  )Tkj  (nod  NT)  (107) 


Observe  that  (106)  and  (107)  have  the  same  fora,  except  for  the  input 
and  output  natrices  U_  and  V.  Next,  we  substitute  (106)  and  (107) 
into  (10S),  and  after  a  number  of  simplif ications  we  obtain 


x  {*Lt  '-2 *  "  i  I  *  (£.,  <2.2 >  e*P I- i 2*  (Dj V  Tk f ) TD,  ’ 0  1  n?  1 


Hi  h2 


♦  expt-  j2*(D,if\2)TI>"V1n2) 


-1  *  LI/0D}'  -2  6  LI/UD2 
i(  *  1  T  '  2.2  C  1  T 

m,  i/i  e2 


Let’s  look  more  closely  at  the  inner  summation:  it  is  a  set  of 


aatrix-Dj  DFTs  of  the  fora 
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c*2.1  .ij’ 


I  *<!»,  'ILj*  •*PI-J2«(B,v"Tk2)  TD^1o“1£5al 
—2 


As  In  section  tto  of  Chapter  XIX,  we  let 


—2  '  — _1— 2 
®  -x 

k  m  V  k 
-2  -  -2 


ril1  n_  c  L  ^  $  k .  c  L.  « 

Then,  —2  —2  and 


Cin^ltj)  *  l  *(ill/i»2)e*Pf-j2w(D1k2)TD21n2)  (109) 

"2 

Since  and  Dj  are  relatively  prime  matrices,  the  operation 

+  * 

(D,k2)  mod  Dj  is  simply  a  permutation  of  the  vector  k,.  Thus,  we  let 


*2  “  <ELiJL2>  <mod  £2> 


(110) 


Then, 


ufm 
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c*2.t'ia)  “  l  x<Ii.|«-2,**pI"*2*  £2T2.2,H.21 


-2  “1/5.2 


k„eL,  ^ 

~2  b 52 


oil) 


It  is  seen  that  the  short  DPT  (lit)  is  a  conventional  rectangular 
watrix-Oy  DPT.  This  module  normally  produces  outputs  ordered  accord¬ 
ing  to  90  these  outputs  must  be  reordered  according  to  Jc^  Cor  the 
in-order-transform.  This  reordering  uses  Dj  and  thus  the  matrix-!^ 
module  is  not  independent  of  the  other  factors.  As  mentioned  above, 
Rothweiller  suggested  the  use  of  distinct  pointers  for  the  input  and 
output  data  of  each  DPT  module.  In  our  program,  array  JR  contains 
the  addresses  for  the  output  data  in  the  permuted  order  required  by 
(110).  Arrays  NUM1  and  NUM2  are  used  to  reorder  the  output  data 
addresses  (N13M1  corresponds  to  the  first  dimension  and  NUM2  to  the 
second  dimension) .  The  DO  215  and  DO  216  loops  compute  the  proper 
permutaticn  maps  for  each  factor  of  the  transform.  The  Fortran  MOD 
function  is  used  to  realize  equation  (110).  The  short  DFT  modules 
are  thus  independent  of  each  other  and  use  array  IN  is  the  input  data 
pointer  and  JR  as  the  output  data  pointer. 

The  sequence  of  inputs  to  the  short  DFTs  is  calculated  from 
the  index  map  (106).  In  the  one-dimensional  case  (106)  reduces  to 

n  -  DjOj  ♦  Dfn2  (mod  N)  (112) 


J.-I31 


Ill 


where  N  "  Df02  and  n,«0, 1 , . . .  ,D,-1 ,  n2-0, 1, . .. ,Dj-l .  The  (nod  N) 
operation  can  be  inplenented  very  ainply  and  efficiently  in  the  fol¬ 
lowing  way  (36). 


1N{1)  -  1 

00  20  n2  -  1,  D2 

DO  30  n(  •  2,  D, 

IHln,)  -  IN(nj-l)  ♦  D2 

30  IFUNln,)  .GT.N)  IN(n^)  -  IMCn^  -  N 

20  IN  (1)  -  IN  ( 1 )  +  D1 


It  ia  seen  that  (112)  is  inplenented  with  two  additions,  one  condi¬ 
tioned  subs tract  ion  and  one  nonpar ison  statement.  The  (mod  N) 
operations  contains  one  conditioned  substraction  and  one  comparison 
instruction  (IF  statement).  The  multidimensional  (mod  N)  operation, 
unfortunately,  doesn't  lend  itself  to  such  a  relatively  simple  imple¬ 
mentation.  To  better  grasp  the  nature  of  this  problem,  let  us 
consider  an  example.  Assume  that  the  addresses  of  the  input  data  are 
given  by  the  vectors  contained  inside  the  parallelopiped  formed  from 
the  columns  of  N,  as  in  Figure  10.  In  this  example 
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Me  find  that  the  Smith  normal  factorisation  of  N  ia 


a -l?  Jl 

a,  - 1;  ;j 


„  I  2  0  , 

^2  ’  lo  2 ' 


V  -  I1  2| 
~  l0  -1  J 


Thus,  we  have 

^/D,  “  t<0  0,T'  <0  ,,T'  ,0  2,T1 

LX/T32  “  t(0  0,T'  {0  ,,T'  (1  0,T'  11  ,>TJ 


Hext  assume  that  £"1nJ  *  (0  1)T  and  jT'j’a  *  DT.  Equation  (106) 

yields 


n  •  (5  2)T  (mod  N) 


'1 13) 


Proa  Figure  10  we  then  deduce  that  the  solution  to  (113)  is 


c?-/3/ 


This  deduction  is  based  on  the  fact  (5  2)T  and  (3  4>T  occupy 

identical  relative  positions  inside  their  respective  parallelo- 
pipedes.  Numerically,  this  operation  translates  into 

(5  2)T  ■  (3  4)T  ♦  N(1  -1)? 

or  equivalently 

0  -1)T  -  N_,n5  2)T  -  (3  4)TJ  (114) 

In  general  terras,  equation  (114)  maybe  coded  in  the  following  way 

For  each  m  c 

compute  M  1  (n-m) 

IF  results  is  an  integer  than  (mod  N) 

otherwise,  loop 

It  is  clear  that  this  procedure  is  highly  time  consuming.  It 
contains  a  vertorial  substraction,  a  matrix  multiplication  and  a 


vectorial  comparison.  It  My  be  improved  by  transforming  the  multi¬ 
plication  step  into  a  sequence  of  additions  since  all  variables  are 
integers.  This  technique  was  used  and  then  abandonned  after  observ¬ 
ing  the  execution  times  of  the  program.  It  was  found  that 
approximately  75  percent  of  the  total  time  was  spent  on  indexing. 
Comparatively,  only  11  percent  of  time  is  spent  on  indexing  in  the 
one-dimensional  PFA  written  by  Burrus  (36).  We  observed  also,  that 
the  time  varied  significantly  with  the  selected  address  space 
Hence,  a  part  of  the  solution  would  consist  in  selecting  an 
appropriate  address  space. 

In  the  following,  we  proviae  a  method  that  solves  the  multi¬ 
dimensional  indexing  problem  represented  by  (106). 

Indexing  Problem 

To  solve  the  geheral  two-dimensional  indexing  problem,  we 
assume  is  given  in  Smith  normal  form  N  ■  I)  \(  with 


D  -  [P  °] 
—  lo 


and  q  ■  tp  for  some  integer  t. 

The  indexing  problem  nay  be  enunciated  as  follows:  given  a 


vector  m  find  a  vector  n  in  L. 


such  that 
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or,  equivalently,  such  that 


Ji  *  2.  *  2.  1  (115) 

for  some  integer  vector  t_  *  (r,  r2)T.  Since  Lj^,  ia  arbitrary,  let 
us  use  a  rectangular  shape  for  It.  The  dimensions  of  the  rectangle 
are  denoted  0  and  V.  Then,  clearly,  we  must  have 

0  V  -  p  q  (116) 

since  pq  is  the  number  of  data  samples. 

As  we  have  seen  in  the  example  above,  the  coordinates  of  m. 
cannot  be  acted  upon  Independently  of  each  other.  In  fact,  it  is 
only  when  N_  is  diagonal  that  (115)  can  be  written  as  two  uncoupled 
one-dimensional  equations.  But,  let  us  stipulate  at  the  outset  that 
we  can  operate  on  the  second  coordinate  of  *  independently  of  the 
first  coordinate.  Consequently,  if  the  stipulation  is  correct, 

n  I  m  ♦  (0  -V) T  (mod  N) 

That  is,  the  (mod  N)  operation  reduces  to  the  (mod  V)  operation  for 
the  second  coordinate  of  m.  Then  the  problem  changes  to:  find 
(r,  r2)  such  that 


-<r1  r2,T  “  <0  "V>T 


(117) 


V...Y';--’.*.  -\.:~ , ■:.  yyyyyy  •  * 
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The  next  atep  ia  to  reduce  the  first  coordinate  of  _«  by  the  (nod  u) 
operation.  But,  this  operation  will  have,  in  general,  an  effect  on 
the  second  coordinate.  This  effect  ia  represented  by  the  integer  W 
and  we  have 


!*<!,  t2»T  -  (-0  W)T  (118) 

where  (t,  t2)T  is  some  integer  vector. 

Thus,  the  solution  is  obtained  by  solving  for  U,V  and  W  in 
(116)  -  (118).  Equations  (117)  and  (118)  are  linear  Diophantine 

equations.  The  solution  of  such  equations  represents  another  appli¬ 
cation  of  the  Smith  normal  form.  Wc  have 

U  D  V  (r,  r2)T  -  (0  -V)T 

0  D  V  (tt  t2)T  -  (-0  W,T 

We  multiply  both  equations  by  1  to  obtain 

V  (r,  r2)T  -  D_  V’  (0  -V)T  (119) 

V  (t,  t2)T  -  £  V  (-0  W)T  (120) 


we  let 


V  (r,  r2)T  -  (l,  VT 


119 


must  b«  an  integer  vector.  Thus,  we  may  set 


0  “  P  <»22.  t  o|2> 


and  then,  solving  for  V  and  H  we  obtain 


(123) 


V 


(0 


22' 


U12> 


(124) 


W  »  a  p  021  (nod  V) 


(125) 


In  (123)  -  (125)  we  assume  that  (U22»  t  0J2)  is  strictly  positive. 
If  it  is  strictly  negative,  we  can  replace  it  by  its  absolute 
value.  He  shall  discuss  the  case  of  (U22>  t  0,2)  equal  to  zero 
shortly  after  we  summarise  what  we  have  done  so  far. 

To  solve  the  indexing  problem  (115),  we  organize  the  data  as  a 
0  by  V  rectangular  array.  Then  the  first  coordinate  of  m_  may  be 
modified  in  a  (mod  U)  fashion.  At  the  same  time,  the  second  coordi¬ 
nate  is  modified  accordingly,  i.e.  w  is  added  (subtracted)  to  it  each 
time  -0  is  added  (subtracted)  to  the  first  coordinate.  The  second 
step  consists  of  reducing  the  second  coordinate  of  *.  in  the  (mod  V) 
fashion.  This  step  has  no  effect  on  the  first  coordinate. 

If  (022,  t  Ojj)  *«ro,  we  cannot  use  this  procedure  for  we 
cannot  evaluate  (124).  The  alternative  is  to  reverse  the  roles  of 
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the  first  and  second  coordinate.  We  still  organise  the  data  as  a  0 
by  V  rectangular  array,  but  it  is  the  first  coordinate  which  is  re¬ 
duced  independently  of  the  other  coordinate.  Equation  (117)  and 
(118)  are  changed  to 


H  (r,  r2)T  -  (-V  0)T 
N  (t}  t2)T  -  (W  -U)T 


(126) 


Using  the  sane  analysis  as  above,  we  obtain 


P  (0 


21' 


011» 


(127) 


V 


(0 


21' 


t  a 


ii 


w  -  a  p  UJ2  (nod  V) 


where  a  and  b  satisfy 


.U2,  *bt  U„  -  (U21,  to,,) 


£-/39 


Again,  In  (127)  we  assuae  that  (U 


2t>  t  0^)  la  a  nonsero. 

As  an  illustration,  let  ua  consider  the  example  given  at  the 
beginning  of  this  section.  He  have  p  -  2  and  q  ■  6.  Thus  t  •  q/p  » 
3.  Also  since 


we  get  <U22'  1  ai2*  "  (“2*  ■  1,  *nd 

-2a  ♦  3b  «  1 

is  solved  with  a  -  1  and  b  »  1.  He  substitute  these  values  in  (123), 
(124),  (125)  to  get 


0-2 
V  -  6 
H  -  2 


Let  us  now  solve  (113)  again,  i.e.  solve 

n  -  (5  2>T)  (mod  N) 

Subtract  0  to  the  first  coordinate  and,  at  the  same  time,  add  H  to 
the  second  coordinate,  to  get 
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n  i  (3  <)T  (mod  N) 

which  Is  the  answer  we  obtained  previously.  Apply  the  operation  a 
second  time,  then 


n  s  (1  6)T  (nod  N) 

The  last  step  consists  of  reducing  the  second  coordinate  (mod  V)  and, 
therefore 


n  «  (1  0}T 

which  is  the  final  answer  since  (1  0)T  is  inside  the  2  by  6  address 
space. 

The  coding  of  this  procedure  contains  six  additions  and  four 
IP  statements.  The  address  of  the  data  is  contained  in  the  airty 
IR.  IR  (1,1)  represents  the  first  coordinate  of  _n  and  ir  (1,2)  the 
second  coordinate.  The  numbers  U,  V  and  W  are  precalculated  and  fed 
to  the  subroutine  as  variables  NS1G1,  NSXG2  and  NSIG3. 

Since  the  vector  ji_  in  (106)  can  take  both  positive  and  nega¬ 
tive  values,  the  procedure  should  handle  both  cases.  Me  will  see  in 
section  three  that  the  number  of  operations  can  be  cut  in  half  for 
the  hexagonal  caae  because _n  is  known  not  to  take  negative  values. 

Evaluation  of  the  MPPA  and  the  POM  Algorithm 

There  are  a  variety  of  cost  measures  used  to  evaluate  the 
performance  of  algorithms.  Two  of  the  most  common  measures  of  OFT 
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the  progress  were  run  on  a  Data  General  Eclipse  MV/10000  at 
the  Digital  Signal  Processing  laboratory  of  the  School  of  Electrical 
Engineering.  The  MV/10000  is  a  32-bit  general-purpose  processing 
system  with  two  megabytes  of  memory  and  floating-point  hardware.  The 
floating-point  hardware  has  an  add  time  of  .(  us  and  a  multiply  time 
of  about  1  us.  For  each  FFT  a  main  program  was  written  that  provides 
random  numbers  as  input.  The  random  numbers  are  taken  from  a  uniform 
distribution  of  sero  mean  and  standard  deviation  of  one.  Table  2 
shows  the  execution  times  in  milliseconds  for  the  three  algorithms 
for  different  sequence  lengths.  The  DFTs  have  rectangular  form  and 
thus  0_  and  V_  are  identity  matrices.  Because  of  the  high  number  of 
allowed  lengths,  not  all  of  them  could  be  tested.  In  table  3,  we 
compare  the  t£D_  V_  algorithm  and  the  MPFA  for  various  lengths  and  with 


In  table  4,  we  select  instead 


0 


-5 

17 


1 


The  second  0_  matrix  was  chosen  to  contain  large  integer  elements  in 
order  to  observe  the  influence  of  those  elements  on  the  execution 
times. 


z-m 
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The  tlMi  obtained  arc  central  processor  tines,  which  are 
measured  with  one  Millisecond  resolution.  The  tines  Measured  on 
successive  runs  rarely  differ  by  sore  than  four  milliseconds  and  it 
is  the  smallest  of  these  Measures  which  is  recorded. 

A  comparison  of  the  UDV  algorithm  and  Singleton's  algorithm  in 
table  2  shows  that  approximately  13  percent  of  the  time  is  used  in 
Indexing  and  87  percent  in  actual  DPT  calculations  and  unscrambling 
(the  unscrambling  contained  in  Singleton's  algorithm). 

Table  3  indicates  that  selecting  a  non-identity  0_  matrix  has 
only  a  small  effect  on  the  speed  of  the  FPTs.  Only  about  2  percent 
of  time  is.  due  to  the  fact  that  is  not  an  identity  matrix. 
Moreover,  in  table  4  we  observe  that  even  when  U_  contains  large 
Integers,  its  effect  on  the  execution  time  is  fairly  constant. 
Tables  3  and  4  indicate  clearly  that  the  MPFA  is  Lister  than  the 
JDD_V_  algorithm.  The 'times  for  the  U_I>1£  algorithm  are  rather 
erratic  as  a  function  of  sequence  length.  This  results  from  the 
program  having  fairly  efficient  sections  for  power  of  two  factors, 
but  less  efficient  sections  for  odd  factors.  The  program  is  also 
slowed  by  the  twiddle  factors  that  are  necessary  in  the  index  mapping 
used. 

The  results  show  that  a  Mixed  form  PFT  can  be  constructed  from 
a  rectangular  PFT  with  only  a  smill  increase  in  execution  tiae.  In 
addition,  we  see  that  the  mpfa  is  a  very  good  choice  for  general 
purpose  FFT.  in  fact,  we  observe  froM  the  table  that  the  MPFA  is 
faster  than  even  the  rectangular  Singleton's  algor ithm. 
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table  1  IRA*  Operations  Count 


WFTA  Length 

Mutliplies 

Adds 

2 

0 

2 

3 

2 

6 

4 

0 

8 

5 

5 

17 

7 

8 

36 

a 

2 

26 

9 

10 

49 

16 

10 

74 
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Table  1  rim •  la  Mlliiteond*  for  Kectargular  ana 


det  ay 

Factorisation 

Singleton 

UDV 

MFFA 

63 

lx(7*9) 

6 

8 

6 

120 

1*(3*5*8) 

12 

15 

11 

180 

.*(4*5*9) 

17 

20 

14 

180 

(3*4) *(3*5) 

16 

18 

11 

252 

1*(4*7*9) 

29 

33 

21 

252 

(2*7) *(2*9) 

29 

33 

20 

315 

1* ( 5*7*9) 

39 

44 

29 

400 

(4*5) *(4*5) 

36 

41 

25 

630 

1*(2*5*7*9) 

84 

95 

62 

1080 

(2*3*5) *(4*9) 

125 

140 

95 

1260 

1*(4*5*7*9) 

179 

199 

130 

1260 

(2*3*5*7) *(2*3) 

170 

189 

115 

Z-Hb 


Prime  Factor  Alqoritha 


In  aany  applications,  the  fora  of  the  periodicity  aatrix  is 
fixed.  For  instance,  for  rectangular  OFTs  the  and  aatrices  are 
equal  to  the  Identity  aatrix.  Often,  in  areas  such  ar  geophysics  and 
antenna  array  design,  the  signals  are  saapled  hexagonally.  The 
periodicity  aatrix  has  then  the  fora 


r  H  i 

lN  2N  ‘ 


where  H  is  an  integer  such  that  3N^  is  the  total  number  of  data 
samples.  Its  Smith  normal  decomposition  is 


,2  1]  [H  0  ]  [1  2  . 
0 J  l0  3NJ  l0  -1  1 


In  such  cases  where  the  fora  of  is  fixed,  it  is  clear  that 
it  is  advantageous  not  to  use  a  general-purpose  FFT  algor  itha.  For 
instance,  a  rectangular  PFA  is  clearly  faster  than  the  NPFA  because 
the  indexing  problea  is  significantly  reduced.  In  this  section,  we 
will  show  how  to  design  a  fixed  form  P?A  algoritha.  More  specifi¬ 
cally,  the  hexagonal  fora  is  selected  to  denonstrate  the  aethod. 

From  (128),  we  obtain 


.-1  rO  1 


r  1  1 

M  -2 J 


—a  —-a..— a.—*  — 
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Ihen  (123)  -  (12S)  yield 


0  -  N 

V  -  3N  (130) 

W  -  N 


therefore,  the  data  is  best  organized  as  a  3N  by  N  rectangular 
array.  In  addition,  because  of  the  special  form  of  JJ,  it  can  be 
shown  that  _n_,  given  by  the  indexing  equation  (106),  has  positive 
coordinates  for  every  N,  This  observation  makes  the  indexing 
algorithm  a  little  simpler  than  in  the  general  case.  The  hexagonal 
indexing  scheme  can  be  formulated  as  follows: 

1)  if  S  <  n2  <  2N,  subtract  N  to  the  second  coordinate  of  ^ 
and  add  n  to  the  first  coordinate 

2)  if  2n  <  n2,  subtract  2N  to  second  coordinate  and  add  2N  to 
first  coordinate 

3)  if  3H  <  N,  subtract  3N  to  first  coordinate. 

This  scheme  was  used  to  program  a  hexagonal  PFA  algorithm. 

An  alternate  factorization  of  N  is  given  by 


l  2  ~  [ 


2 

1 


(131) 


The  first  observation  we  make  on  (131)  is  that  _P  and  £  commute.  Thus 


the  MPFA  algorithm  will  result  in  matrix-*^  DFTs  and  matrix-Q  DFTs. 
The  second  observation  is  that  £  is  a  rectangular  matrix,  which  sim¬ 
plifies  further  the  Indexing  problem.  In  addition,  the  matr*x-Q_  OFT 
is  always  a  N  by  N  DPT  and  thus,  if  N  is  factored  into  prime  numbers, 
it  can  be  evaluated  very  effic  antly  by  the  newly  presented  p  by  p 
FFT  algorithms.  The  matrix-P.  DPT  is  a  non-rectangular  3-point  OFT 
with  a  fairly  trivial  indexing  problem.  In  our  implementation,  the  N 
by  N  short  module  is  evaluted  by  the  row-column  method. 

In  .  -ble  5  we  list  the  execution  times  of  the  MPFA  and  the 
hexagonal  PPA  (HFFA)  for  various  lengths.  We  find  that  the  RPFA  is 
approximately  10  percent  faster  than  the  MPFA.  It  is  important  to 
note  again  that  the  savings  obtained  in  table  S  are  the  result  of  the 
knowledge  we  have  of  the  1J  matrix,  which  allowed  us  to  reduce  the 
complexity  of  the  indexing  problem. 


ptiaal  Periodicity  Matrix 


It  is  clear  from  our  discussion  that  there  are  various  ways 
for  constructing  an  algorithm  using  the  PFA  technique.  In  the 
algorithms  described  above,  the  modules  were  one-dimensional  WFTAs. 
Our  discussion  of  the  classification  of  MDFTs  suggests  a  more  general 
approach.  It  consists  in  using  two  levels  of  modularity!  the  first 
level  separates  the  short  matrix  DFTs  according  to  lengths,  the 
second  level  evaluates  the  short  DFTs  in  an  appropriate  manner. 
Renee,  the  second  level  for  each  length  n,  is  composed  of  a  finite 
set  of  submodules  corresponding  to  the  finite  number  of  nonequivalent 
length  n  DFTs.  This  bilevel  structure  allows  us  to  include  new 


2-I6D 


algorithms,  as  they  become  available,  for  evaluting  the  submodules 
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for  instance ,  Au slander  _et  _al  have  recently  proposed  an  efficient 
method  for  evaluting  p  by  p  DPT  a  when  p  is  a  prime.  In  the 
algorithms  we  designed  all  the  submodules  are  evaluated  in  the 
row- column  manner.  Hybrid  algorithms,  where  some  of  the  submodules 
are  evaluated  using  the  row-column  technique  and  the  others  are  eva¬ 
luated  in  more  efficient  ways,  are  possible.  Some  tradeoffs  have  to 
be  exercised  between  execution  speed  and  program  length. 

Ne  now  proceed  to  answer  the  questions  we  asked  in  Chapter 
II:  given  a  finite  extend  signal  x,  what  is  the  best  periodicity 
matrix  that  can  be  used  to  compute  the  DPT  of  x?  It  is  clear  that 
the  answer  depends  on  the  shape  of  the  region  of  support,  and  on  the 
algorithm  for  evaluating  DFTs  at  hand.  The  problem  in  its  most 
general  form  is  still  open,  nevertheless  some  answers  can  be  given  if 
the  shape  of  the  region  of  support  is  rectangular.  For  instance, 
assume  we  have  a  4  by  4  rectangular  array  in  the  first  quadrant  of 
the  time  domain.  Moreover,  assume  we  have  at  our  disposition  the 
MPFA  algorithm  we  designed.  The  first  periodicity  matrix  that  can  be 
used  is 


H-l04  !l 


which  is  factored  as 


,4  0.  .1  0. 

*0  1'  l0  4' 
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the  MPFA  evaluates  this  DPT  as  eight  length- 4  WFTAs  (four  rows  and 
four  columns).  The  execution  time  is  found  to  be  2.5  ma. 

An  alternate  periodicity  matrix  can  be 


which  is  factored  as 


N 


1} 


f2  ’j  f2  °]  f1  °l  I1  3I 

l1  0J  l0  1 J  l0  8J  l0  -1 J 


This  DPT  is  evaluated  as  two  length-8  WFTAs  and  eight  length-2 
WFTAs .  The  execution  time  is  2.0  ms. 

Finally.  N  can  be. selected  to  be 


■  -U 

and  the  Smith  normal  factorization  is 

a  -  H  li  i!J  l l  -fl 

The  DPT  is  then  evaluated  as  one  length- 16  WFTA  in  1.4  ms. 
Therefore,  the  third  selection  of  N  is  to  be  preferred.  This  result 
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can  ba  explained  by  the  fact  that  the  16-point  WFTA  ia  note  efficient 
than  the  corresponding  number  of  length-8  WFTAa  and  length-4  WFTAs. 
A  general  conclusion  may  be  inferred  which  is  that  for  a  rectangu¬ 
larly  shaped  signal  select  the  periodicity  matrix  that  leads  to  a  DPT 
which  uses  the  most  efficient  submodules  of  the  FFT  algorithm. 
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CSAPTBR  V 

CONCLUSIONS  AND  RBCOMfBNDATIONS 


Conclusions 

The  objective  of  this  thesis  was  to  develop  efficient 
algorithms  for  evaluating  general  multidimensional  DFTs.  The  first 
issue  which  was  addressed  concerned  the  formulation  of  a  mathematical 
context  in  which  better  understanding  of  the  inner  workings  of  the 
DFT  is  possible.  More  precisely,  the  multidimensional  indices, 
present  in  the  DFT,  were  considered  to  belong  to  a  lattice 
structure.  The  computational  structure  of  the  DFT  was  then  shown  to 
be  closely  related  to  the  special  properties  of  this  mathematical 
structure. 

The  second  issue  was  concerned  with  the  extension  of  the 
important  Chinese  Remainder  Theorem  which  plays  an  important  role  in 
the  effective  handling  of  the  DFT  indices.  We  formulated  and  gave  a 
new  class  of  FFT  algorithms  which  provide  the  same  interesting  add- 
mult  iply-control  tradeoffs  available  in  the  PPA,  wfta  and  split 
nested  DFT.  We  showed  that  these  algorithms  can  also  be  used, 
advantageously,  to  evaluate  rectangular  DFTs. 

The  relationship  between  nonrectangular  DFTs  and  rectangular 
ones  was  shown  to  be  linked  to  the  Smith  Normal  factorization  of  the 
periodicity  matrix.  In  fact,  this  factorisation  allowed  us  to 
develop  an  algorithm  (the  0_  D_ algorithm  which  reduces  a  general 
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DFT  into  the  combination  of  a  rectangular  OPT  and  a  set  of  input  and 
output  Indexings. 

Finally,  practical  algorithms  were  constructed  that  implement 
both  the  0,0.  V'  and  the  MPPA  algorithms.  The  success  of  these 
algorithms  was  found  to  be  contingent  on  an  efficient  implementation 
of  the  input  and  output  indexings.  We  derived  a  method  which  solved 
these  indexings  in  a  successful  way.  In  addition,  we  implemented  an 
efficient  hexagonal  PFA  as  an  illustration  for  constructing  PPT 
algorithms  with  a  given  specific  fora. 

Be  rn—  endat  ions 

A  qeneral  approach  was  used  to  analyse  the  multidimensional 
OFT.  The  objective  of  the  thesis  was  to  concentrate  on  the  extension 
of  the  PFA  and  WFTA  algorithms.  However,  further  work  can  be  pursued 
to  apply  the  procedure  developed  in  this  thesis.  First,  the  lattice 
structure  can  be  used  to  consider  not  only  the  number  of  multipli¬ 
cations  in  a  DFT,  but  also  the  number  of  additions,  the  number  of 
data  memory  accesses,  the  amount  of  indexing  work,  etc. 

Another  topic  for  future  research  concerns  the  development  of 
a  method  for  finding  optimal  periodicity  matrices.  In  our  thesis  we 
discussed  the  case  where  the  finite  extent  signal  has  a  rectangular 
form.  It  if  interesting  to  ask  how  to  choose  the  periodicity  matrix 
when  the  signal  does  not  have  a  rectangular  extent.  One  solution 
consists  in  padding  the  signal  with  zeros  until  it  becomes  a 
recta. igle.  But  this  method  can  become  very  inefficient  since  it 
enlarges  the  size  of  the  DFT. 
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An  interesting  application  which  nay  potentially  benefit  from 
the  results  presented  in  this  thesis  is  the  area  of  number  theoretic 
transforms  (NTTs) .  The  NTT  has  a  structure  similar  to  the  OFT,  but 
with  complex  exponential  roofs  of  unity  replaced  by  integer  roots  and 
all  operations  are  defined  modulo  an  integer.  In  the  NTT  it  is  the 
signal  itself  which  is  treated  in  a  number  theoretic  way.  The 
lattice  structure  could  become  an  ideal  structure  for  handling  a 
multidimensional  signal.  Our  approach  can  then  be  used  to  develop 
fast  algorithms  for  evaluating  multidimensional  NTTs. 
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APPENDIX 


THE  MPFA  ALGORITHM 


THIS  MAIN  ROUTINE  TESTS  THE  MATRIX  PRIME  FACTOR  ALGORITHM 
UNI  IS  A  RANDOM  NUMBER  GENERATOR 


DIMENSION  X(0: 104 ,0: 34) , 1(0: 104,0:34) ,NN(4,2) ,N( 1,2) 
INTEGER  BUFF(8) ,BUFF2(8) 

INTEGER  U(2,2) 

DATA  N,NN/2,6,l,2,t,l,3,2,l,l/ 

DATA  NSIGl,NSlG2,NSIG3,M/2,6,0,2/ 

DATA  U/l ,0,0,1/ 

CALL  FOPEN( 20 , “MOUR" ) 

DO  11  I-0.NSIG1-1 
DO  12  J-0.NSIG2-1 
X(I,J)«fUNI(l) 

T(I,J)-0.0 
12  CONTINUE 
11  CONTINUE 

CALLFIN(X,Y,N,NNtM,NSIGl,NSIC2,NSIG3,U) 

URITE(20, 100)  X,T 
100  FORMAT(E12.4) 

CALL  FCL0SE(20) 

STOP 
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SUBROUTINE  FIN(X, Y,N,NN,M,NSIG1 ,NSIC2 ,NSIC3,UI) 

THIS  SUBROUTINE  IS  A  MATRIX  PRIME  FACTOR  FFT  ALGORITHM 

X  AND  Y  ARE  THE  REAL  AND  IMAGINARY  PARTS  OF  THE  INPUT  DATA 
ON  OUTPUT,  X  AND  Y  CONTAIN  THE  COMPLEX  SPECTRUM 
THE  INPUT  DATA  SHOULD  BE  ORGANIZED  AS  AN  NSIGl  BY  NSIC2  ARRAY 
THE  OUTPUT  IS  IN  ORDER  ,I.E  IT  IS  ORGANIZED  IN  THE  SAME  MANNER 
AS  THE  INPUT. 

UI  IS  THE  INVERSE  OF  THE  INPUT  INDEXING  MATRIX  U. 

M  IS  THE  NUMBER  OF  FACTORS  OF  N.  THESE  FACTORS  ARE  CONTAINED 
IN  THE  ARRAY  NN.  THEY  SHOULD  BE  RELATIVELY  PRIME  TO  EACH  OTHER 
N(l,t)-NN(I,l)'NN(2,l)*...*NN(M,l) 

DIMENSION  IT( 1,2) ,IL( 1,2) , IR( 1 ,2) , IN( 16,2) ,IM(0:255 ,2) 

INTEGER  N(l,2),UI(2,2) ,NUM1 (  16) ,NUM2( 16) , JR( 16,2) 

DIMENSION  X(0:35,0:37),Y(0:35,0:37) ,IP(1 ,2),NN(4,2) 

INTEGER  TNI ,TN2,UtNl ,U2NI ,UIN2,U2N2,U1TN1 ,U2TNl ,U1TN2,U2TN2 

ARRAY  IM  CONTAINS  THE  ADDRESSES  OF  THE  INPUT  POINTS  TO  THE 
N1  BY  TNI  SMALL  DFT  MODULE  .  ARRAY  JR  CONTAINS  THE  ADDRESSES 
OF  THE  OUTPUT  POINTS  TO  THE  SMALL  DFT  OF  LENGTH  NI  (WFTA  OF 
LENGTH  Nl)  WHERE  Nl  IS  ONE  FACTOR  OF  N. 

WFTA  ARE  THE  VERY  EFFICIENT  SMALL  LENGTH  DFT  DEVELOPED  BY 
WI NOG RAD. THERE  ARE  WFTA  OF  LENGTHS  2,3,4,5,7,8,9,16,(11,12,13) 

DATA  C31.C32  /0.*6602540,0.5/ 

DATA  C51.C52.C53  /0.95105652, 1.5388418,0. 36327126/ 

DATA  C54.C55  /0. 55901699 ,-l .25/ 

DATA  C71 ,C72,C73  /-l. 16666667,0.79015647,0.055854267/ 

DATA  C74,C75,C76  /O. 7343022, 0.44095855, 0.34087293/ 

DATA  C77.C78  /0.53396936, 0.87484229/ 

DATA  C92,C93,C94  /0.93969262.-0. 1 7364818.0.76604444/ 

DATA  C96.C97.C98  /-O. 34202014,-0.98480775,-0. 64278761/ 

DO  200  Jl-l.M 
N1-NN(J1,1) 

TNI'*NN(J1,2) 

N2-N(l,l)/Nl 

TK2-N(l,2)/TN1 

UINI-UI(1,1)*N1 

U2N1-UI(2,1)*N1 

U 1N2“UI( 1 , l)*N2 

U2N2-UI(2,l)*N2 

U1TN1*UI( 1 ,2)*TNl 

U2TNI»UI(2,2)*TN1 

UITN2-UI(1,2)*TN2 

U2TN2-U I ( 2 , 2 ) *TN2 


COMPUTE  THE  OUTPUT  PERMUTATION  MAP 


,3-/5$ 
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Kl-0 

DO  21S  Il-l.Nl 
NUM1 ( II )-HOD(Kl ,N1 )+l 
213  K1-KI+N2 
Kl-0 

DO  216  Il-l.TNl 
NUM2(  1 1  )-MOD( K1 . TNI  Hi 
216  K1-K1+TN2 

DO  THE  INPUT  PERMUTATION 

IT(l,l)-0 
IT(l,2)-0 
DO  210  J2-1.TN2 
IL(l,l)-IT(l,l) 

U(1,2)-IT(1,2) 

DO  220  J3-1.N2 
IP(l.l)-mi.l) 

IP( 1 ,2)-IL( 1 ,2) 

J-l 

DO  230  J4-1.TN1 
IR<  1 , 1)-IP(  1,1) 

IR( 1 ,2)-IP( 1,2) 

DO  240  JS-l.Nl 

REDUCE  IM(J)  MODULO  THE  MATRIX  N 

339  IP(IR(1,1).LT.NSIG1)  CO  TO  338 
IR( 1 , 1 )-IR( I , l J-NSIGl 

IR( 1 ,2)”IR( 1 , 2)+NSIC3 
CO  TO  339 

338  IF(IR(l,l).CE.O)  CO  TO  340 
IR(l,l)-IR(l,l)+NSICl 
IR(1,2)-IR( 1 ,2)-NSIC3 
GO  TO  338 

340  IF(IR( 1 , 2).LT.NSIC2)  CO  TO  343 
IR(  1 ,2)-IR(  1 , 2J-NSIC2 

GO  TO  340 

343  IF(IR( 1 , 2) .GE.O)  GO  TO  342 
IR( 1 ,2)-IR( 1 , 2)+NSIC2 
CO  TO  343 

342  IM(J,l)-IR(l,l) 

IM(J,2)-IR( 1 ,2) 

J-J+l 

IF(J5.CT.l)  CO  TO  239 
IP( 1 , 1)-IR( 1,1) 

IP( 1 ,2)-IR( 1,2) 

239  IR(1,1)-IR(1,1)+U1N2 


Ji'teo 


ooo  o  o  n a o  « 


240  IR(l,2)-IR(l,2)+UzN2 

IF( J4.GT. 1 )  GO  TO  229 
IL(1.1)-IP(1.1J 
IL(i,2)-IP(l,2) 

229  IP(1,1)-IP(1.1)+01T}I2 

230  IP(  1 ,2)-IP(  1 ,2)-HJ2TN2 

TO  MATRIX-P  DPT 


BOH  DFT'S 

IF(Nl.EQ.l)  GO  TO  205 

IS-1 

Ml—  Ml 

90  70  14-1, TNI 

Ml-Ml+Nl 

K-Ml+1 

IN( 1 , 1)-IM(K, 1 ) 

IN( 1 ,2)-IK(K,2) 

JR(NUM1 ( 1 ),l)-IN(l,l) 

JR(NUM1( 1 ) ,2)-IN( 1,2) 

DO  80  15-2, N l 
K-M1+I5 

IN(I5, l)-IM(K, l) 

IN(I5,2)-IM(K,2) 

JR(NUM1(I5) , 1 )»1N(I5, 1) 

80  JR(NUMl(T.5)  ,2)-IN(I5,2) 

GO  TO  (990, 102,103, 104, 105,990, 107,990, 109)  SI 
70  CONTINUE 

COLUMN  DFT'S 

205  IS-2 

IF(TNl.EQ.l)  GO  TO  219 
DO  110  16-1, N1 
:N(l,l)-IM(I6,t) 

IN( 1 ,2)-lM( 16 ,2) 

JR(NUM2( 1 ), 1 )»IN( 1,1) 

JR(NUM2(1),2)»IN( 1,2) 

Ml-0 

DO  120  17-2, TNI 
Ml-Ml+Nl 

1N(I7, l)-IM(I6+Ml ,1) 

IN<I7,2)-IM(I64M1,2) 

JR(NUM2(I7) , 1)«IN(I7 ,1) 

JR(NUM2(I7) ,2)-lN(I7,2) 

120  CONTINUE 

GO  TO  (990,102,103,104,105,990,107,990,109)  TNI 
1 10  CONTINUE 
219  IF(J3.GT.l)  GO  TO  221 


uuu  u  a  u  u  u 
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IT(1.2)-1L(1,2) 

221  IL(1,1)-IL(1,1)+U1N1 

220  IL(1,2)-IL(1,2)+U2N1 
IT(1,1)-IT(1,1)-HJ1TN1 
210  IT(l,2)-IT(l,2)-HJ2TNl 
200  CONTINUE 
RETURN 
990  STOP 

WFTA  N-3 


103  Tl*»(X(IN(3,l),IN(3,2))-X(IN(2,l) ,IN(2,2)))*C31 
U2»(Y(IN(3, 1) ,IN(3,7))-Y(IN(2, 1) ,1N(2,2)))*C31 
R1-X(IN(3.1),IN(3,2))+X(IN(2,1),IN(2,2)) 
S1-Y(IN(3,1),IN(3,2))+Y(IN(2,1>,IN(2,2)) 

T2-X( IN( 1 , l ) , IN( 1 , 2) )-Rl *C32 
U2-Y(IN(l,l),IN(l,2))-Sl*C32  • 

X ( JR(1 , 1 ) , JR(1 , 2 ) )-X( IN( 1 , 1 ) , 1N(1 , 2 ) )+Rl 
Y(JR( 1,1) ,JR( 1 ,2))“Y(IN( 1,1), IN( 1 »2))+Sl 
X(JR(3, 1)  ,JR(3,2))«T2-HJ1 
X  (  JR(  2 , 1 ) ,  JR(  2 , 2  )  )-T2-U  l 
Y(JR(3,1),JR(3,2))«U2-T1 
Y(JR(2,l),JR(2,2))-U2+Tl 
GO  TO  (70,110)  IS 


WFTA  N-5 

105  R1“X(IN(2, l) ,IN(2 ,2))+X( IN( 5,  l) ,  111(5,2)) 
R2-X(IN(2,l),IN(2,2))-X(IN(5,l),IN(5,2)) 

51- Y(XN(2,l),IN(2,2))+Y(IN(5,l),IN(5,2)) 
S2_Y( JN(2, 1) , IN(2,2))-Y(IN(5, 1) , 1N(5,2)) 

52- Y(IN(2,1),IN(2,2))-Y(IN(5,1),IN(5,2)) 
R3-X(IN(3,1),IN(3,2))+X(IN(4,1),IN(4,2)) 
R4-X(IN(3,1),IN(3,2))-X(IN(4,1),IN(4,2)) 

53- Y(IN(3,1),IN(3,2))+Y(IN(4,1),IN(4,2)) 

S4-Y(XN(3,1),1N(3,2))-Y(IN(4,1),IN(4,2)) 

Tl-(R2+R4)*C51 

U1-(S2+S4)*C51 

R2-T1-R2*C52 

S2-U1-S2*C52 

R4«T1-R4*C53 

54- Ui-S4*C53 
T1-(R1-R3)*C54 
U1-(S1-S3)*C54 
T2-R1+R3 


02-SI+S3 

X(  JR( 1,1) 

Y(JR( 1,1) 

T2,;X(JR(  l 

U2-Y(JR(1 

R1-T2+T1 

R3-T2-T1 

SI-U2-HJ1 

S3-U2-U1 

X(JR(2 , 1 ) 

X(JR(5,1) 

Y(JR(2 , 1 ) 

Y(JR(5, 1 ) 

X(JR(3,1) 

X(JR(4,l) 

Y(JR(3,1) 

y(jr(4,i) 

GO  TO  (70 


,  JR( 1 ,2) )-X( 1N( 1 . 1 ) ,1N( 1 , 2) }+T2 
,  JR( 1 ,2))"Y(IN( 1,1 )  ,IN( 1 , 2) )+U2 
, 1 ) ,JR( 1 ,2))+T2*C55 
, 1 ) ,  JR( 1 , 2) )+U2*C55 


,JR<2,2)) 
,JR(5,2)) 
,  JR(2,2) ) 
,JR(5,2)) 
,JR(3,2)) 
,JR(4,2)) 
,JR(3,2))] 
,JR(4,2)) 
,110)  IS 


-R1+S4 

-R1-S4 

-SI-R4 

-S1+R4 

•R3-S2 

-R3+S2 

-S3+R2 

-S3-R2 


WFTA  N-2 

T1“X(IN( 1 , 1 ) , 1N( 1,2)) 

X(IN(  1 , 1)  ,  IN(  1 ,2))“Tl+X( IN(2,1),IN(2,2)) 
X(IN(2 , 1 ) , IN( 2 ,2) )“T!-X( IN( 2 , 1 ) , 1N(2 ,2) ) 
T1"Y<IN( 1 , 1 ) , IN( 1,2)) 

Y( IK(1 , l ) , IN( 1 , 2 ) )-Ti+Y< INC  2 , 1 ) , IN( 2 , 2) ) 
Y(IN(2 , 1)  ,IN(2 ,2*)  )”T1-Y(IN(2,1),IN(2,2)) 
GO  TO  (70,110)  IS 


WFTA  N-4 

R1-X(IN(1, 

R2-X(IN(1, 

51- Y(IN(1, 

52- Y(IN(1, 
R3-X(IN(2, 
R4«X(IN(2, 

53- Y(IN(2, 

54- Y(IN(2, 
X(JR(1,1), 
X(JR(3, 1) , 
Y(JR(  1 , 1)  i 
Y(JR(3, 1 ) , 
X(JR(2 , 1 )  , 
X(JR(4 , 1) , 
Y(JR(2,1) | 
Y(JR(4 , 1) , 
GO  TO  (70, 

WFTA  N-7 


,1),IN(1,2))+XIIN(3, 
, 1 ) ,IN( 1 , 2) )-X(IN(3, 
, 1) ,IN( 1 ,2) )+Y( IN( 3, 
,  1) , IN( 1 ,2))-Y(IN(3 1 
,1),IN(2,2))+X(IN(4, 
, 1) ,IN(2 ,2) )-X(IN(4 , 
, 1) , IN(2 ,2) )+Y(IN(4 , 
,1),IN(2,2))-Y(IN(4, 
,JR(1,2))-R1+R3 
,JR(3,2))-K1-R3 
,JR(1,2))-S1+S3 
,JR(3,2))-S1-S3 
, JR(2 ,2) )"R2+S4 
,JF(4,2))-R2-S4 
,JR(2,2))-S2-R4 
,JR(4,2))-S2+R4 
,110)  IS 


l),IN(3,2)> 
1) ,IN(3,2)) 
1) ,IN(3,2)) 
1) ,IK(3,2)) 
1),IN(4,2)) 
1),IN(4,2)) 
1) ,IH(4,2)) 
l),IN(4,2)) 


«3~/fc3 
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107  R1-X(IN(2,1),IN(2,2))+X(IN(7,1),IN(7,2)) 
R2-X(IN<2,1),IN(2,2))-XUN(7,1>,IN(7,2» 
S1-Y(IN<2,1),IN<2,2))+Y(IN<7,1),IN(7,2)) 
S2“Y(IN(2 , 1 ) , IN( 2,2) )-Y( XN(7,1),IN(7,2)) 
R3"X(IN(3, 1) , IN(3 ,2))+X( IN(6 , 1 ) , IN(6, 2)) 
R4»X(IN(3,1) ,IN(3,2))-X(IN(6, 1) ,IN(6,2)) 
S>Y(ZN0,1),IN0.2)HY(III(*.1),IN(6,2)> 
S4“Y(IK(3, 1) ,IN(3,2))-Y( IN(6,1),IN(6,2)) 
R5“X(IN(4,1),IN(4,2))+X(IN(5,)),IN(5,2)) 
R6*X(IN(4, 1) ,IN(4,2)'-X( IN(5,1),IN(5,2)) 
S5“Y(IN(4,1) ,IH(4 ,2) )+Y( XN( 5 , 1 ) ,IN( 5,2) ) 
S6"Y(IN(4, 1 ) , IN(4 ,2))-Y(IN( 5, 1),IN(5,2)) 
T1-R1+R3+R5 
U1-S1+S3+S5 

X( JR(  1 , 1 ) , JR(l , 2)  )-X<  IN(1 , l)  ,  1 , 2) )+Tl 

Y(JR(  1 , l) , JR< 1 ,2) )-Y( IN( 1 , 1 ) , 1N( 1 , 2) )+Ul 
T1-X(JR( I,1),JR( 1 ,2) )+C7 1*T1 
Ul-Y(JR(  1,1)  ,JR<  1 ,2))+C7  1*01 
T2“C72*(R1-R5) 

02-C72*(Sl-S5) 

T3-C73*(R5-R3) 

U3-C73*(S5-S3) 

T4-C74*(R3-R1> 

U4-C74*(S3-Sl) 

R1-T1+T2+T3 

R3“T1-T2-T4 

R5-T1-T3+T4 

51- Ul+02+03 

53- U1-U2-U4 

55- 01-U3+U4 
U1“C75*(S2+S4-S6) 

T 1 "C7  5* ( R2+R4-R6) 

T2-C76*(R2+R6) 

U2-C76*(S2+S6) 

T3-C77*(R4+R6) 

U3-C77*(S4+S6) 

T4-C78*(R4-R2) 

U4-C78MS4-S2) 

R2-TI+T2+T3 

R4-T1-T2-T4 

R6-T1-T3+T4 

52- 01+U2+U3 

54- U1-U2-U4 

56- 01-U3+U4 

X(JR(2, 1),JR(2,2))-R1+S2 
X( JR(7 , 1) ,JR(7 ,2) )»R1-S2 
Y(JR(2 , 1 ) ,JR(2 ,2) )“S1-R2 
Y(JR(7,l),JR(7,2))-Sl+R2 
X(JR(3,1),JR(3,2))-R3+S4 
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X(  JR(6 ,  1 ) ,  JR(  6 ,2) )-R3-S4 
Y(JR(3,1),JR(3,2))-S3-R4 
Y(JR(6,l)6JR(6,2))“S3+R4 
X( JR(4 , 1 ) , JR(4 , 2) J-R5-S6 
X(JR(5,l),JR(5,2))-R5+S6 
Y(JR(4 , 1 )  ,JR(4 ,2))“S5''R6 
Y(JR(5, i ) , JR( 5 , 2) >-S5-R6 
GO  TO  (70,110)  IS 

VFTA  N-9 

R1-X(IN(2, 1 ) ,IN<2 , 2) }+X(IN(9 , 
R2«X< IN( 2 , 1 ) , IN( 2 , 2 ) )-X( IN(9 , 

51- Y(IN(2,1),IN(2,2))+Y(IN(9, 

52- Y(1N(2,1),IN(2,2))-Y(IN(9, 
R3-X(IN(3,1),IN(3,2))+X(IN(8, 
R4»X(1N(3, 1 ) , IN(3 , 2) )-X(IN(8, 

53- Y(IM(3,l),IN(3,2))+Y(tN(8, 

54- Y( IM( 3, 1 ) , IN(3 , 2 ) )-Y( 1N(8 , 
R5-X(IN(4,1),IN(4,2))+X(IN(7, 
T— <X(IN(4,1),IN(4,2))-X(IN(7 

55- Y(1N(4,1),IN(4,2))+Y(1N(7, 
U— (Y(IN(4,1),IN(4,2))-Y(IN(7 
R7-X(IN(5,1),IH(5,2))+X(IN(6, 
R8-X( IN( 5 , 1 ) , INC  5 , 2 ) )-X( 1N( 6 , 
S7“Y(IN(5, 1) , IN(5,2))+Y( IN(6, 

58- Y(IN(5,l).IN(5f2))-Y(lH(6, 
R9-X(Ur(l,l),IN(l,2))+R5 

59- Y(IM(l,l),IN(l,2))+S5 
T1-X(IM(1 , 1 ) ,  INO ,  2)  )-R5*C32 
0 1-Y( IH( l , 1 ) , IN(l , 2) )-S5*C32 
T2-(R3-R7)*C92 
U2-(S3-S7)*C92 
T3-(R1-R7)*C93 
U3-(S1-S7)*C93 
T4“(R1-R3)*C94 
U4-(S1-S3)*C94 
R10-R1+R3+R7 
S10-S1+S3+S7 
R1-T1+T2+T4 

R3-T1-T2-T3 

R7-T1+T3-T4 

S1-U1+U2+U4 

S3-U1-U2-U3 

S7-U1+U3-U4 

X(JR( 1 , 1) ,JR( 1 ,2))-R9+R10 
Y( JR( 1 , 1 ) , JR( l ,2) )“S9+S10 
R5"R9-R10*C32 
S5«S9-S10*C32 
R6"-( R2-R4+R8 ) *C  3 1 

56-  ($2-S4+S8)*C31 
T2-(R4+R8)«C96 
U2-(S4+S8)*C96 
T3-(R2-R8)*C97 


l',IN(9, 
1) ,IN(9 , 
1) ,IN(9  , 
1) ,IN(9 , 
1) ,IN(8, 
1),IN(8, 
1) ,IN(8, 
1),1N(8, 
1 ) , IN(7 , 
, 1 > ,IN(7 
1).IN(7, 
,1),IN(7 
1) ,IN(6, 
1) , IN(6, 
1) ,IN(6 , 
1) ,IN(6, 


2>) 

2)) 

2)) 

2)) 

2)) 

2)) 

2)) 

2)) 

2)) 

,2)))*C31 

2)) 

,2)))*C31 

2)) 

2)) 

2)) 

2)) 
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03-<S2-S8)*C97 

T4»( R2+R4 )  *C9  8 

U4-(S2+S4)*C98 

R2-T+T2+T4 

R4-T-T2-T3 

R8-T+T3-T4 

S2-U-HJ2+U4 

S4-U-U2-U3 

S8-U+U3-U4 

X(JR(2, 1),JR(2,2))« 

X(JR(9  1 1 )  ,JR(9 ,2)  )« 

Y(JR(2, 1)  ,JR(2,2) )« 

Y(JR(9 , 1),JR(9 ,2))* 

XCJR(3,1),JR(3,2))- 

X(  JR(8 , 1 ) ,  JR(8 , 2)  )* 

Y(JR<3,1),JR<3,2)). 

Y(JR(8,1).JR(8,2))« 

X(JR<4,1),JR<4,2)). 

X(JR(7 , 1 ) , JR(7 ,2) )■ 

Y(JR(4,1),JR(4,2))- 

Y(JR(7 , 1 ) , JR(7 ,2 ))■ 

X(JR(5,l),JR(5,2))- 

X(JR(6,1),JR(6,2))- 

Y(JR(5, 1 ) ,JR(5,2) )* 

Y(JR(6, 1) , JR(6,2))- 

GO  TO  (70,110)  IS 
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SUMMARY 


In  evaluating  the  multidimensional  discrete  Fourier  transforms 
(DFTs)  or  circular  convolutions  for  rectangularly  periodic  sequences,  it 
has  been  very  common  to  apply  one-dimensional  algorithms,  and  its  appli¬ 
cation  is  very  straightforward.  However,  it  becomes  cumbersome  for 
periodic  sequences  other  than  the  rectangularly  periodic  ones. 

In  this  paper,  a  general  method  of  applying  one-dimensional 
algoritnms  to  the  general  multidimensional  case  is  presented  for  the 
evaluation  of  circular  convolutions  and  OFT*.  This  method  stems  from 
the  decomposition  of  the  periodicity  matrix  for  an  arbitrarily  periodic 
sequence  which  gives  the  new  coordinate  system  on  which  the  sequence 
can  be  viewed  rectangularly  periodic.  This  paper  also  presents  how  the 
method  is  related  to  the  Winograd  Fourier  transform  algorithm  as  a 
special  case. 
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CHAPTER  I 

INTRODUCTION 

As  digital  signal  processing  techniques  have  become  popular, 
many  algorithms  have  been  developed  to  compute  discrete  Fourier  trans¬ 
forms  (DFTs)  and  circular  convolutions  which  have  reduced  the  computa¬ 
tional  complexity  of  these  calculations.  In  the  multidimensional  case, 
an  alternative  way  of  reducing  the  computational  complexity  Is  to  reduce 
Che  number  of  samples  to  represent  the  original  signal.  This  can  be 
achieved  with  a  proper  sampling  scheme. 

The  most  commonly  encountered  sampling  scheme  is  rectangular 
sampling.  However,  it  cannot  be  considered  as  an  optimal  sampling 
scheme  on  all  occasions.  For  example,  for  circularly  band-limited  sig¬ 
nals  hexagonal  sampling  requires  13.41  fewer  samples  than  rectangular 
sampling  (1].  One  of  the  reasons  that  other  sampling  schemes  are  less 
well  appreciated  chan  rectangular  sampling  is  that  for  rectangularly 
sampled  signals,  one-dimensional  algorithms  can  be  generalised  straight¬ 
forwardly  in  evaluating  DFTs  or  circular  convolutions,  while  for  the 
signals  sampled  with  other  schemes,  it  is  much  more  coapllcated. 

In  this  research,  a  unified  treatment  was  done  to  compute  DFTs  and 
circular  convolutions  for  general  multidimensional  signals.  The  main 
idea  is  that  after  changing  the  form  of  the  DFTs  or  circular  convolutions 
with  non-diagonal  periodicity  matrices  into  a  fora  using  diagonalized 
periodicity  matrices,  we  can  apply  one-dimensional  algorithms  directly. 
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This  vu  done  by  decomposing  the  periodicity  matrix. 

Tbla  papar  ia  dividad  Into  £iva  chapters.  Chapter  II  presents  a 
literature  survey  and  background  study.  The  first  subsection  of  back¬ 
ground  is  primarily  concerned  with  definitions,  especially  the  definition 
of  the  periodicity  matrix.  In  cha  next  subsection,  the  decomposition  pro¬ 
cedure  is  introduced.  Chapter  III,  which  is  the  main  chapter,  is  devoted 
to  the  mathematical  derivation  of  generalizing  algorithms  for  circular 
convolutions  and  the  DFTs  using  the  decomposition  technique.  With  the 
method  derived  in  Chapter  111  any  one-dimensional 'algorithms  can  be 
applied  to  the  general  multidimensional  ease  in  evaluating  the  DFTs  or 
circular  convolutions.  In  Chapter  IV,  the  efficiency  of  the  new  method 
is  discussed  when  it  is  applied  to  the  Winograd  Fourier  tra.-.sform 
algorithm  (WFTA)  [3]  for  the  evaluation  of  the  general  multidimensional 
DFTs.  Finally,  the  conclusions  are  presented  in  Chapter  V. 


CHAPTER  II 


LITERATURE  SURVEY  AND  BACKGROUND 


2.1  Literature  Survey 


The  necessity  for  developing  fast  and  efficient  convolution  and 

DFT  algorithms  stems  from  the  fact  that  the  direct  computation  of  length-N 

2 

convolutions  end  DFTs  requires  a  number  of  operations  proportional  to  N 
which  becomes  rapidly  excessive  for  large  dimensions. 

One  of  the  most  important  algorithms  for  computing  one-dimensional 
DFTs  is  the  fast  Fourier  transform  algorlth-n  (FFT)  [8]  introduced  by 
Cooley  and  Tukey  in  196S,  which  computes  a  one-dimensional  N-point  DFT 
with  the  number  of  operations  proportional  to  NlogjN,  when  N  is  a  power 
of  2.  This  reduces  drastically  the  computational  complexity  for  large 
transforms.  Since  convolutions  can  be  computed  by  DFTs.  the  FFT  algorithm 
can  also  be  used  to  compute  convolutions  with  a  number  of  operations 
proportional  to  NlogjN  and  has  therefore  played  a  key  role  in  digital  sig¬ 
nal  processing  ever  since  its  introduction.  Recently,  many  new  efficient 
convolution  (2,7,12,16]  and  DFT  techniques  [3,4,5,6,8,9,10,16]  have  been 
Introduced  to  decrease  the  computational  complexity.  Perhaps  the  most 
important  of  these  algorithms  are  the  Wlnograd  Fourier  transform  algorithm 
(WFTA)  [3]  Introduced  in  1978  and  a  circular  convolution  algorithm  pre¬ 
sented  by  Agarwal  and  Cooley  in  1977  [2],  The  former  achieves  a  theoreti¬ 
cal  reduction  of  computational  complexity  over  the  FFT  by  a  method  which 
can  be  viewed  as  the  converse  of  the  FFT,  since  it  computes  a  DFT  as  a 
convolution.  The  Agarwal-Coolcy  algorithm  is  considered  as  a  major 
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breakthrough  for  the  computation  of  large  convolutions.  It  converto 
one-dimensional  convolutions  Into  multidimensional  convolutions  by 
applying  Good ' a  napping  method  [11]  and  achieves  good  efficiency. 

Many  of  the  algorithms  introduced  above,  however,  were  originally 
designed  for  one-dlaenaional  cases,  and  have  been  used  to  compute  the 
multidimensional  OFT  and/or  circular  convolutions  of  rectangularly 
sampled  signals.  In  the  multidimensional  case,  we  can  think  of  two  ways 
to  reduce  the  computational  complexity.  The  first  one  is  to  develop 
more  efficient-algorithms  for  the  multidimensional  case  such  as  the  one 
developed  by  Nussbaumer  and  Quandalle  [4].  Their  algorithm  is  more 
efficient  in  computing  DFTs  than  the  WFTA  if  the  size  of  the  sampled 
signal  is  NxNx. . .xN.  The  second  way  is  to  reduce  the  number  of  points 
Itself  to  represent  the  original  signal  by  using  a  proper  sampling  scheme. 
The  most  commonly  encountered  sampling  scheme  is  rectangular  sampling. 
Algorithms  for  processing  rectangularly  sampled  signals  can  be  straight¬ 
forwardly  generalized  from  the  one-dimensional  case.  Peterson  and 
Middleton  [17],  however,  showed  in  1962  that  rectangular  sampling  is  a 
special  case  of  a  mure  general  sampling  strategy,  and  also  showed  that 
hexagonal  sampling  is  the  optimal  sampling  scheme  for  signals  which  are 
band-limited  over  a  circular  region  of  the  Fourier  plane,  in  the  sense 
that  exact  reconstruction  of  the  wave  form  requires  a  lower  sampling 
density  than  with  alternative  schemes.  For  such  signals  hexagonal 
sampling  requires  13. 4X  fever  samples  than  rectangular  sampling.  However, 
it  is  no  longer  true  that  one-dimensional  or  multidimensional  algorithms 
can  be  applied  straightforwardly  in  processing  generally  sampled  signals. 
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The  first  unified  treatment  for  auch  signals  was  done  by 
Marsereau  and  Speake  [9]  in  1981.  They  generalized  the  Cooley  and 
Tukey  FFT  algorithm  for  the  general  multidimensional  case  and  achieved 
a  significant  reduction  in  computation.  However,  their  approach  is 
restricted  to  certain  algorithms  and  hence  its  application  range  is 
somewhat  restricted.  In  this  paper  a  more  general  treatment  is  pre¬ 
sented  for  the  computation  of  DFTs  and  circular  convolutions. 

2.2  Background 

This  section  is  divided  into  three  subsections.  In  the  first  sub¬ 
section  the  general  idea  of  a  periodic  extension  of  a  multidimensional 
sequence  and  its  usage  is  introduced.  In  the  second  subsection  general 
forms  of  circular  convolution  and  the  DFT  are  derived.  These  two  sub¬ 
sections  are  strictly  based  on  Mersereau's  work  (1}.  In  the  last 
subsection,  a  decomposition  method  [15]  for  integer  matrices  is 
illustrated  explicitly  and  plays  a  key  role  in  Chapter  III. 

2.2.1  Periodicity  of  Multidimensional  Sequences 

A  two-dimensional  sequence  xfo^.nj)  is  rectangularly  periodic  if 

xfnj.nj)  «  xfnj  +  J^.nj)  (2.1) 

-  xfo^.nj  +  Hj) 

for  all  (n^.nj).  The  numbers  and  Nj  are  positive  integers.  If  they 
are  the  smallest  possible  positive  integers  for  which  equation  (2.1)  holds. 
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they  are  called  the  horizontal  and  vertical  perloda  of  x.  Any  periodic 
array  vith  horizontal  and  vertical  periods  Kj  and  1»2  is  completely 
specified  by  NjNj  independent  sas^les.  For  exasple  let  us  consider 
Figure  2.1. 

Figure  2.1(a)  shows  a  rectangularly  periodic  two-dimensional 
sequence  with  horizontal  period  -  5  and  vertical  period  N2  »  6.  Figure 
2.1(b)  shows  the  fundamental  period  of  that  sequence.  It  is  obvious 
that  every  sample  in  the  sequence  is  equal  to  one  cf  the  samples  in  the 
fundamental  period,  the  region  0  <  nj  <  llj  ■  1,  0  (  nj  <  «2  •  1. 

It  can-  be  seen  that  the  rectangularly  periodic  sequence  is  a 
special  case  of  generally  periodic  sequences.  In  the  multidimensional 
case,  an  M-dlmensional  sequence  x(it)  is  said  to  be  periodic  with  period 
N  if 


x(n)  -  x(n  +  Nr)  (2.2) 

for  all  integer  vectors  in  and  r  and  some  MxM  integer  matrix  N  whose 
determinant  is  nonzero.  Such  a  sequence  repeats  itself  In  the  M  dif¬ 
ferent  directions  which  are  defined  by  the  column  vectors  of  N.  For 
this  reason,  N  is  called  the  "periodicity  matrix"  of  the  sequence. 

Uhlle  there  is  no  unique  shape  to  the  set  of  samples  comprising  one 
period  of  a  periodic  sequence,  the  number  of  samples  in  any  period  is 
|det  N|,  which  must  be  an  integer  since  N  is  an  integer  matrix.  Th« 
most  commonly  encountered  periodic  sequences  are  those  for  vhlch  N  is 
diagonal.  Such  sequences  are  called  rectangularly  periodic.  For  the 
previous  example, 


But  sometimes  we  encounter  periodic  sequences  other  than  rectangularly 
periodic  sequences  such  as  the  one  shown  In  Figure  2.2.  This  sequence 
Is  sampled  on  a  hexagonal  raster.  The  periodicity  matrix  N  for  this 
sequence  Is 


Figure  2.2(b)  shows  a  fundamental  period  of  the  sequence  in  (a).  As 
mentioned  earlier,  the  shape  to  the  set  of  samples  comprising  one  period 
of  a  periodic  sequence  is  not  unique.  In  Figure  2.3(a),  we  show  the 
same  periodic  sequence  where  the  fundamental  periods  are  shaped  like 
parallelograms.  Any  sample  which  is  a  member  of  one  fundamental  period 
can  be  exchanged  for  the  corresponding  point  In  any  other  period  to  pro- 
duce  a  different  fundamental  period.  Thus  Che  choice  of  a  fundamental 
period  Is  not  unique.  This  Is  true  for  any  arbitrarily  periodic 
sequence.  The  availability  to  exchange  one  form  of  a  fundamental 
period  for  another  Is  useful.  For  exaeqtle,  let  us  take  Figure  2.2  and 
Figure  2.3.  When  discussing  symmetry  properties  of  th?  discrete  Fourier 
transform,  a  hexagonal  shape  for  the  fundamental  period  Is  helpful,  and 
when  confuting  the  DPT,  the  parallelogram  form  Is  helpful. 


(a)  The  Same  Periodic  Sequence  as  Shown  In  Figure  2.2 
Except  that  the  Parallelograms  are  Used  to  Represent 
the  Periods  of  the  Sequence. 

(b)  A  Fundamental  Period  In  the  Shape  of  a  Parallelogram 


2.2.2  General  Form  of  the  DFT  and  a  Circular  Convolution 

tec  ue  consider  a  periodic  sequence  x(n)  with  periodicity  matrix 
N.  For  such  a  sequence 


x(n)  ”  x(n  +  Nr)  (2.2) 

for  any  integer  vector  £.  Let  1^  denote  a  region  in  the  n-plane  which 
contains  one  period  of  this  sequence.  This  region  is  called  the  fundamen¬ 
tal  period. 

As  for  the  one-dimensional  case,  let  us  assure  that  x(n)  can  be 
uniquely  represented  as  a  finite  sum  of  harmonically  related  complex 
sinusoids.  Then,  with  prime  (')  indicating  transposition, 

x(n)  -  aOOexpfJk'R’nl  (2.3) 


where  k  is  an  Integer  vector  and  denotes  a  finite  region  in  the 
k-plane.  Since  the  sequence  x  is  periodic, 

x(n)  ”  x(n  +  Nr)  -  ^  a(k)exp( jk’R' (n  +  Nr)!  (2.4) 

k£JN 

»  ^  a (k) exp [ jk '  R 'Hr ] exp [ Jk ' R ' n ) 
kS 


Since  the  right  side  of  equation  (2.3)  and  equation  (2.4)  muse  be 
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equal  for  all  values  of  n. 


exp[jkVNrl  -  1 


(2. 51 


for  all  integer  vectors  k  and  r.  For  non-trivial  R’  and  N,  Eq.  (2.5) 
Implies  that 


R*S  -  2-rl 


(2.“>..i, 


or 


R*  -  2-S-1 


<2.6 -1) 


By  substituting  for  r'  and  letting  a(k)  »  r;  1~.,:  X(k),  we  obtain 

—  |det  Nj  — 


*  Jfiet" N |  £  *(k)expIJk'(2r»_l)B) 


<2. 7) 


kCj., 


Since  the  coaplex  exponentials  in  this  sum  are  periodic  in  both  n 
(periodicity  matrix  N)  and  k  (periodicity  matrix  s')  we  •■ee  that  at  most 
|det  s|  samples  of  X(k)  can  be  independent.  Thus  the  region  J„,  like 
Ijj,  contains  only  |det  N|  samples.  If  X(k)  is  defined  as 


X(k)  ■  ^  x(n)exp(-jk,(2_S”1)n] 
-eIN 


(2.8) 


wc  can  establish  the  existence  of  a  Fourier  series  relation  for  any 


periodic  sequence.  It  Is  straightforward  to  verify  that  Eq.  (2.7)  and 
Eq.  (2.8)  constitute  an  identity.  It  is  also  straightforward  to  establish 
the  uniqueness  of  Eq.  (2.8)  due  to  the  orthogonal it"  of  the  complex 
exponentials  exp[-Jk'  (2-'S”*)n,l  over  the  region  1^,  It  should  be  also 
noted  that  X(k)  is  periodic  with  periodicity  naf  ..  s’: 

X(k)  -  X(k  +  N  V)  (2.9) 

If  x(n)  is  a  finite-extent  sequence  with  support  confined  to  I,., 
we  can  use  tl.a  above  Fourier  series  relation  to  define  a  discrete 
Fourier  transform  OFT) 


X(k)  -  £  x (n) exp ( - jk * ( 2"N~l ).n J  (2.10) 


-ClX 


t(n)  -  i"— |  J  X(k)exptjk*(2-:S_l)n]  .  (2.11) 


Equation  (2.10)  Is  a  general  form  of  the  DFT.  Based  on  this  we  can 
establish  a  general  form  of  a  circular  convolution. 

Suppose  we  have  two  finite-extent  sequences.  x(n)  and  h(n),  with 
support  on  I,,  whose  DFTs  are  X(k)  and  H(k)  ,  respectively,  with  support  on 
Jjj.  Let  Y(k)  be  the  DFT  of  y(n)  formed  by 

Y(k)  -  H(k)X(k)  (2.12) 


and  let  us  determine  y(ii)  in  terms  of  x(n)  and  h(n). 
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We  shall  begin  by  considering  the  periodically  extended  sequence 
x,  h,  and  y  with  the  periodicity  Lvatrix  N,  and  X,  H,  and  Y  with  the 
periodicity  matrix  N1.  Since 


Y  (k)  -  H  ( k)  X  (k)  , 


by  applying  the  Inverse  discrete  Fourier  series,  we  obtain 


(2.13) 


y(n>  "  Jdet"Nj  £  H(k)X(k)expf  Jk’(2-N'1)nJ  .  (2.14) 


By  expressing  xOO  as 

X(k)  -  ^  x(n)exp[-jk ' (2',JJ~*)m]  , 

mcL. 

substituting  it  into  Eq.  (2.14),  and  rearranging  terms,  we  get 
y(n)  -  £  x(m)  £  H(k)expfjk’(2-N"1)(n  -  m)1 

ncly  ~  kcJy 


(2.15) 


y  x(m)h(n  -  m) 


(2.16) 


Since  y(n)  was  defined  to  be 


y(n)  - 


y(£)  for  £  c  1^ 


0  otherwise  , 


(2.17) 


3.-IV 
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we  can  write 

y(n)  •  ^  x (m) h (n  -  m)  for  n  c  1^  (2. 18. a) 

or  alternatively 

y(n)  -  £  x(m)h(((n  -  n))^)  (2.18.b) 

where  ((  ) );J  denotes  modulo  S  operation,  y  Is  said  to  be  the  circular 
convolution  of  h  and  x.  The  term  circular  convolution  Is  carried  over 
from  one-dinensional  signal  processing  terminology.  The  circular  convolu¬ 
tion  can  also  be  written  In  the  alternate  form  similar  to  the  one- 
dlmenslonal  case 


y(n)  *  ^  h(m)x(((n  -  •  (2.19) 

®£l}, 


2.2.3  Decomposition 

With  non-diagonal  periodicity  matrices,  it  Is  cumbersome  to  compute 
circular  convolutions  of  DFTs  with  one -dimensional  algorithms  or  some 
multidimensional  algorithms.  However,  If  these  general  forms  are  changed 
Into  the  forms  with  diagonal  periodicity  matrices,  it  becomes  straight¬ 
forward  to  compute  them  with  proper  algorithms.  This  can  be  achieved  by 
decomposing  the  periodicity  matrices.  Since  it  plays  a  key  role  <*s  will  be 
seen  In  Chapter  III,  it  will  be  Illustrated  explicitly.  The  method  to 
be  introduced  Is  done  by  Kaufman  (15). 


■  j*,'.*.  ■*m**sr *-"< if* ’t«vcairft&; -)*?/&{_#&%* : 
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Def  lnltlona 

Regular  unlaodular  tutrix:  A  regular  unimodular  matrix  Is  a 
square  matrix  whose  determinant  Is  1  or  -1. 

Subtraction  matrix:  An  Identity  matrix  for  which  one  zero 
replaced  by  any  real  number  is  called  an  elementary  subtraction  matrix. 
If  we  denote  an  elementary  subtraction  matrix  as  U  whose  1—  row 
and  j —  column  Is  (-'i)  as  shown  below 

fl  0  ol 


-l.J.t 


0  0 

1  -1 

0  1 


-1-2 


(2.20) 


then  It  has  the  following  properties:  If  a  matrix  A  is  premultlplled  by 
-i.j.a'  row  of  £  w111  be  re<luced  *>y  1  times  the  J—  row  of  A.  And 

postmultlpl lcat Ion  of  a  matrix  B  by  U.  reduces  the  J—  colunm  of  B 
by  a  times  the  1  column  of  B.  The  following  examples  show  these 
properties; 


1  0  0 

0  1  ~rx 

0  0  1 


11 

*12 

a13  ~ 

r  *u 

a12 

*13 

21 

a22 

a23 

■  a2l""',a31 

a22~ia32 

*23~ia 

31 

a32 

*33_ 

L  a3l 

a32 

*33 

(2.21) 


3 
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bn  bi2  bi3 

b21  b22  b23 

b31  b32  b33 

It  is  clear  that  elementary  subtraction  matrices  are  unimodular  matrices 
since  their  determinant  is  1  and  therefore  the  product  of  elementary 
subtraction  matrices  is  also  a  unimodular  matrix. 

Smith's  normal  form:  Any  matrix  which  contains  a  diagonal  sub¬ 
matrix  with  non-zero  diagonal  elements  and  whose  other  elements  are  all 
zeros  is  in  Smith's  normal  form  as  shown  below 


1  0  0 

0  1  -a 

0  0  1 


'll 

b12 

b13^b12 

21 

b22 

b23-ab22 

'31 

b32 

b33*°b32 

(2.22) 


with  d.  <  d,  <  d,  .  .  .  <  d  . 

12  3  r 

Procedure  for  the  decomposition 

Even  though  the  procedure  Is  applicable  to  any  real  matrices.  It 
will  be  restricted  to  square  Integer  matrices,  since  we  are  interested 
in  decomposing  periodicity  matrices.  The  goal  of  our  decomposition  is 
to  associate  an  m  x  m  matrix  N[  with  an  ra  x  m  diagonal  matrix  D  in  such 
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a  way  that 


N  ♦  V  •  D 
—toco  -tax  to  1  toco 


(2.24) 


where  U  and  V  are  regular  unlmodular  matrices. 

The  procedure  will  be  Illustrated  with  an  example  with 


(2.25) 


Step  1: 

Move  the  smallest  non-zero  element  in  absolute  value  In  N  to  the 
st  st 

1 —  row  and  1 —  column  with  proper  permutation  matrices  and  let  us  denote 
it  as  dj .  For  N  given  above  let  us  Cake  1  In  position  (2,1)  as  d^  and 
move  It  to  the  position  (1,1)  In  such  a  way  that 


P 

“21  ~ 


(2-26) 


Step  2: 


Construct  proper  subtraction  matrices,  and  V^,  to  replace  the 
(l,l)th  element  of  the  matrix  from  step  1  with  r.  .  for  1  «  2,3,. ...m  and 

If* 

the  (1,J)  element  of  the  matrix  with  r,  .  for  J  »  R,...,m  where  the 

J 

remainders  r^  ^  and  r^  ^  are  defined  by 
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*1J  "  aljdl  +  rlJ  * 
*11  *  “ildl  +  rll  ’ 


where  denote  the  element  of  the  new  matrix  and  are  integer  num¬ 
bers.  This  can  be  achieved  by  postmultiplications  with  subtraction 
matrices  V  and  preaultiplications  with  subtraction  matrices 

i.l.^lj 

-i,l,au*  the  renain<l«fs  and  r^  are  all  aero,  step  2  is  com¬ 
pleted.  If  not,  step  1  and  2  should  be  repeated  with  the  new  matrix 
until  the  remainders  become  zero.  For  the  given  exattple,  we  can 
determine  and  as  follows: 


"l 

o 

o 

I 

1 — 
o 

1 

1 _ 

h  * 

-3 

_0 

o 

9-  O 

1 _ 

and  * 

0  1  0 

_°  0  1_ 

(2.27) 


By  premultiplying  P21*N  with  U,  we  obtain 


«x  '  (I21*n) 


I 

© 

O 

1 _ 

~1  2  0~ 

1 

o 

-4 

1 _ 

-3  10 

3  1  2 

■ 

0-5  2 

1 

*-4 

o 

o 

_ 1 

_0  l  1_ 

- 1 

H 

H 

© 

.  we  get 


(2.28) 


JL-I% 


If  we  postmultlply  this  with  V 


Since  ve  have  all  the  remainders  equal  to  zero,  seep  2  Is  completed. 

Step  3: 

Upon  the  completion  of  step  2,  the  resulting  matrix  Is  In  the  form 
as  shown  below: 


dx  |  0  .  .  .  0 

0  J 

i 

i 

.  |  t-l^-lxm-l 


(2.30) 


0 


To  complete  the  decomposition,  repeat  step  1  and  step  2  for  the  sub- 
matrix  and  Nj  .  .  .  and  so  on  until  a  diagonal  matrix  is  obtained. 

Let  us  proceed  with  the  given  example.  If  we  consider  the  rlghthand 
side  of  Eq.  (2.29),  It  is  necessary  to  permute  the  matrix  so  that  the 
smallest  non-zero  element  of  the  submatrix  in  absolute  value  could  be 
placed  In  the  position  (2,2).  This  can  be  achieved  by  preimiltiplicatlon 
with  the  permutation  matrix  such  that 


z32  •«v(-2r£>)  *  ^i> 


2-W 


21 


10  0 

0  1  1  |  (2.31) 

0  -5  2 

Then  according  co  step  2,  subtraction  matrices  and  should  be 
determined.  In  the  case  they  are  as  follows 


0 

0 

H 

l _ 

1 

h-» 

O 

£2  - 

0  10 

and  V2  - 

0  1  -1 

_0  5  1_ 

1 

O 

O 

(2.32) 


By  premul tip llcat Ion  with  U,  and  tostmultlpllcation  with  V,,  we  get 


«2  *  QL32  '  *  <I21*S»  *  V 


“1 

0 

0“ 

“l 

0 

o" 

0 

0~ 

m 

0 

1 

0 

0 

1 

1 

- 

0 

1 

1 

0 

5 

1_ 

_0 

-5 

2_ 

_o 

0 

7_ 

•  ((ut  - 

<*21 

N)) 

•  vL)). 

^2 

“1 

0 

0” 

~1 

0 

0" 

”1 

0 

0~ 

m 

0 

1 

1 

0 

1 

-1 

m 

0 

1 

0 

_0 

0 

7_ 

_0 

0 

1_ 

_o 

0 

7_ 

(2.33) 


-  D 

(2.34) 

The  leftside  of  Eq.  (2.34)  can  be  rewritten  In  such  a  way  that 


jl  -m 


\ 
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%  *  <£32  *  <(-l  *  <—21  ‘-I55  *  -2  “  <— 2  *  -32  *  U1  '  -21*  *  -  '  <-l  *  -2* 

-O.S.V  (2.35) 

with  U  -  U2  *PJ2  •  •  P21  and  V  -  V  .  V2  .  Then 


a-m 
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CHAPTER  III 


CEJfERALIZATIOJf  PROCEDURE 


In  the  previous  chapter,  some  basic  background  was  Introduced 
which  might  be  necessary  to  follow  the  generalization  procedure  for  :he 
circular  convolution  and  l'FT  algorithms.  Based  on  chl3,  the  generaliza¬ 
tion  of  circular  convolution  algorithms  will  be  discussed  in  section 
3.1  below  and  the  generalization  of  DFT  algorithms  will  be  discussed 
in  section  3.2.  It  will  be  seen  that  the  decomposition  of  the 
periodicity  matrix  plays  a  key  role  in  both  sections. 


3.1  Generalization  of  Circular  Convolution  Algorithms 
In  the  multidimensional  case,  the  general  form  of  the  circular 
convolution  y  of  sequence  x  and  h  can  be  represented  as 


y(n)  *  X(*)h((m  -  m))N  ,  ntl^  (3.1) 

or  alternatively 

y(n)  -  ^  x((n  -  m)),,h(m)  ,  n  e  Ij,  .  (3.2) 

BEl^ 

where  H  denotes  the  periodicity  matrix  when  h  or  x  are  periodically 
extended,  1^  denotes  the  set  of  samples  In  one  period,  and  ((  ))^  denotes 
the  modulo  N  operation.  If  the  sequences  x  and  h  have  the  size 
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N  «  N^xN2*...xNd  and  the  periodicity  matrix  ^  Is  diagonal  such  that 


0  ...  0 


0  N, 


0  0  .  .  .  SV 


then  Eq.  (3.1)  can  be  written  in  the  following  form: 

•V1  V1  V1 

•  •  *  .Oj)  ■  ^  ^  ...  ^  x ( dj  .m., .  •  •  •  *nj) 

m^*0  n^-O  rij-0 


h<<nl  "  “l^s  *((n2  *  . ((nd  '  nd»N  >  (3‘3) 

1  2  d 


for  nj^  •  0,1,.  ...N11,  i  -  1,2 . d. 

It  is  straightforward  to  evaluate  Eq.  (3.3)  by  using  any  proper 
algorithms  for  one-dimensional  circular  convolutions.  For  example. 

In  the  two-dimensional  case,  Eq.  (3.3)  becomes 
N.-l  N2-l 

y(n1-"2)  *  ^  x(m1,m2)h(((n1  -  mj))^  ,((n2  -  m2))N  )  .  (3.4) 

m1-0  m2“0  1  2 

Equation  (3.4)  can  be  computed  as  a  circular  convolution  of  length  Nj  in 
which  each  scalar  multiplication  is  replaced  by  a  convolution  of  length 
Nj.  So  if  is  the  nunber  of  multiplications  required  to  compute  a 


£-3o  I 


2S 

convolution  of  length  S^>  Eq.  (3  4)  can  also  be  evaluated  with 
multiplications.  Similarly  Eq.  (3.3)  can  also  be  computed  with 
multiplications. 

But  if  the  periodicity  matrix  S  is  not  diagonal,  the  one- 
dimensional  algorithms  cannot  be  applied  directly.  To  apply  these 
algorithms  directly  some  modification  is  necessary. 

3.1.1  General  Approach 

There  may  be  many  ways  to  achieve  this,  but  one  way  to  be  dis'us- 
sed  is  to  change  the  general  form  of  equation  (Eq.  (3.1))  to  the  form  of 
Eq.  (3.3).  Then  it  can  be  treated  in  the  sane  way  as  for  Eq.  (3.4). 

This  modification  technique  stems  from  the  decomposition  of  the  periodic¬ 
ity  matrix  S.  As  will  be  seen  later  in  this  chapter,  it  Is  quite  general. 

First  let  us  consider  the  simple  two-dimensional  sequence  in 
Figure  3.1.  In  Figure  3.1,  tiie  region  of  support  (or  the  fundamental 
period,  shaded  region)  is  periodically  extended  according  to  the  periodic¬ 
ity  matrix 


based  on  the  axes  n^  and  nj* 

But  this  sequence  nay  also  be  viewed  as  rectangularly  periodic 

f  A 

based  on  the  new  axas  and  n2  with  a  diagonal  periodicity  matrix  N 
given  by 


a-£°Q 
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Thus  after  napping  each  point  In  the  old  coordinates  (n^,n2)  to  the  new 
coordinates  (n^.n^),  we  can  have  the  circular  convolution  form  of  Eq. 
(3.3).  But  the  new  coordinate  system  on  which  the  sequence  becomes 
rectangularly  periodic  is  not  unique  and  in  general  it  is  tedious  to 
find  these  coordinate  systems  by  ad  hoc  means.  One  general  method  is 
to  systematically  decompose  the  periodicity  matrix  S. 

As  was  shown  in  the  previous  chapter,  any  square  matrix  A  can  be 
associated  with  a  diagonal  matrix  D  in  such  a  manner  that 

U  A  V  -  D  ,  (3.5) 

where  U  and  V  are  regular  unimodular  matrices.  If  a  periodicity  matrix 
N  is  substituted  for  A  in  Eq.  (3.5),  we  have 

U  N  V  -  D  .  (3.6) 

Let  us  consider  S  •  first.  N  •  V  can  be  viewed  as  another  representation 
of  the  periodicity  matrix  S. 

This  can  be  illustrated  as  follows.  The  column  vectors  of  S 
represent  the  directions  of  periodicity.  Since  post-multiplication  by  V 
gives  a  new  matrix  whose  column  vectors  are  linear  combinations  of  the 
column  vectors  of  N ,  and  furthermore  since  V  is  a  unimodular  matrix, 
periodic  extensions  according  to  N  and  N*V  are  equivalent  and 


det(N  •  V)  «  det(N).  Hence,  if  we  denote  N  *  V  ■  M,  M  is  another 
representation  of  the  periodicity  matrix  N,  and  we  have 

0  •  M  -  D  .  (3.7) 

From  Eq.  (3.7),  if  we  consider  U  as  a  transfer  matrix  which  maps  each 
point  of  the  sequence,  then  the  column  vectors  of  0  ^  form  the  new 
coordinate  system  on  which  the  sequence  is  rectangularly  periodic  with 
the  diagonal  periodicity  matrix  D.  It  can  be  verified  as  follows. 

Let  l  form  the  i—  coordinate  of  the  new  coordinates  and  e^  the 
i—  coordinate  of  the  original  coordinates  whose  i— entry  is  1,  and  0 
elsewhere.  Then 

U  •  -  e4  .  (3.8) 

Since  det(U)  i  0,  Eq.  (3.8)  becomes 

ZL  -  U"1^  .  (3.9) 

And  it  is  obvious  that  e^'s  are  linearly  independent.  Thus  the  ^'s  are 
also  linearly  independent  and  the  i—  column  vector  of  U  *  forms  the  i— 
coordinate  of  the  new  coordinates.  Then  any  vector  x  in  the  space  can  be 
represented  as  a  weighted  sum  of  e^'s  or  _L^'s  such  that 


where  Che  a^'s  end  tt ' s  are  conacants  which  denote  Che  Indices  based  on 
Che  original  coordinates  and  the  new  coordinates,  respectively.  If  we 
premultlply  x  in  Eq.  (3.10)  by  the  transfer  matrix  U,  we  have 


«*-«(][  -a)  Mi) 

‘  I  bi<Mi> 


(3.11) 


It  is  easily  seen  from  Eq.  (3.11)  that  every  point  on  the  original 
coordinate  system  can  be  mapped  onto  the  new  coordinate  system  by  pre¬ 
multiplying  it  by  the  matrix  U. 

Now  let  us  consider  the  general  form  of  the  multidimensional 
circular  convolution  (Eq.  (3.2)) 


y(n)  -  £  x((n  -  k) )s  h(k). 


(3.2) 


Equation  (3.2)  can  be  changed  into  the  r.ew  coordinate  system  as  follows. 


y(((Un))D)  -  ^  x(((Un  -  UV))D)h(Uk), 


aE  h  * 


(3.12) 


Let  us  assign  new  variables  for  Un  and  Uk  such  chat 
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Then  Eq.  (3.12)  becomes 


y(m)  -  ^  x(((m  -  J:))D)h(l)  ,  m  E  Ip  , 


(3.13) 


where  denote  the  set  of  samples  in  one  period  corresponding  to  the  new 
periodicity  matrix  D.  Clearly  Eq.  (3.23)  is  in  the  form  of  a  circular 
convolution  for  a  rectangularly  periodically  extended  sequence.  Thus 
the  output  sequence  can  be  obtained  in  the  following  manner:  1.  Obtain 
the  transfer  matrix  U  by  decomposing  the  original  periodicity  matrix  N. 

2.  Map  every  point  in  one  period  onto  the  new  coordinates  by  multiplying 
the  transfer  matrix  U.  3.  Compute  the  circular  convolution  in  the  same 
way  as  for  the  rectangular  case.  4.  Map  the  result  back  onto  the  original 
coordinates.  Tn  the  following,  a  simple  exanple  will  be  presented  in 
order  to  illustrate  the  procedure. 

3.1.2  Example 

Let  us  consider  Figure  3.2.  This  figure  shows  the  periodic  exten¬ 
sion  of  the  shaded  region  (fundamental  period)  according  to  the  periodic¬ 


ity  matrix 


t-P  *1. 

LO  2_ 


If  we  decompose  N  with  the  procedure  Illustrated  in  the  previous  chapter » 


we  get 


JL-£01 


** «»M*^***WW\f4** v^v. •>-**'* 
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Since  the  rectangular  periodicity  matrix  was  defined  to  have 

[  1  0  1 

positive  diagonal  elements,  let  us  postmultiply  by  I  g  I  on  both 
sides.  Then 


As  mentioned  earlier  N*V^  is  just  another  representation  of  the  periodic¬ 
ity. 


N  «  V 


M 


To  find  the  new  coordinates,  let  us  take  the  inverse  of  U. 


1  0 

2  1_ 

Thus  we  have  (1,2)  as  the  new  n^  coordinate  which  is  denoted  as  n^  and 
(0,1)  as  the  new  Oj  coordinate  which  happens  to  be  the  same  as  the 


original  coordinate.  It  we  examine  Figure  3.2,  the  periodically  extended 
version  is  rectangularly  periodic  on  the  new  coordinates  (nj.n^)  with  the 
periodicity  matrix 


D  - 


Now  let 


Ij,  -  {(0,0).  (0,1),  (1,0),  (1.1)} 


and 


Ij,  -  {(0,0). (0,1), (0,2), (0.3)}  . 


Then  each  point  in  1^  is  mapped  into  the  new  coordinates  as  follows. 


(0,0) 

(0,0) 

(0,1) 

(0,1) 

X 

(1.0) 

■ 

(1.-2) 

(l.D 

(1,-1) 

If  we  take  the  modulo  D  operation  for  each  vector  on  the  right  side,  we 


(<0,0))D  -  (0.0) 
<(0.1))D  -  (0.1) 
(U.-2))D  -  (0.2) 


((l.-l))D  -  (0.3) 


Thus  each  point  in  1^  is  mapped  as  follows 


After  the  mapping  is  done,  the  procedure  for  the  evaluation  of  the 
circular  convolution  is  straightforward.  It  should  be  noted  that  the 
output  sequence  has  to  be  mapped  back  into  the  original  coordinate 
sy;.om  using  the  reverse  napping. 

3.2  General  Approach  to  the  Evaluation  of  the 
Multidimensional  DFT 

Any  M-dimenslonal  sequence  x(n)  with  the  periodicity  matrix  S  can 
be  exactly  represented  by  a  set  of  Fourier  series  coefficients  which  will 
be  denoted  by  X(k)  where 


*r  »./«•? 

r: 


£(n)  "  TdirNf  ]>  X(k)exp[JJc‘  (2rS-1)nJ  (3.14) 


X(k)  "  y  x(n)expf-jk'  (2irN-1)n] 

*£Si 


(3.15) 


The  sequence  of  coefficients  X(k)  is  periodic  with  the  periodicity  matrix 
s'  with  a  prime  (’)  indicating  the  operation  of  vector  or  matrix  trans¬ 
position.  The  regions  1^  and  denote  the  set  of  samples  in  one  period 
of  x(n)  and  X(k) ,  respectively.  If  x(n)  and  X(k)  are  defined  to  be 
sequences  with  finite  support  on  1^  and  respectively,  then 


x(n)  “  ^  *(*»  ♦  -Lq) 

q 

X(k)  -  ^  X(k  +  N’r) 

£ 

where  q  and  £  vary  over  all  M-d linens ional  Integer  vectors  and 


x(n)  , 

n  e  Si 

0 

otherwise 

X(k)  , 

k  c 

0 

otherwise  . 

Thus  the  relationship  between  x(n)  and  X(k)  can  be  obcalned: 


(3.16) 


(3.17) 


(3.18) 


(3.19) 


V 

:? 

*• 

A 
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Xflc)  -  ^  x(n)exp[-jk'(2TTN-1)n],  k  c 

— £lU 


(3.20) 


x(n)  -  ^  X(k)exPrjIt,(2i:s‘1)n].  ne  1^  (3.21) 


In  Che  remainder  of  che  section,  we  shall  consider  a  general 
approach  co  che  computation  of  che  multidimensional  DFT.  The  object  of 
the  approach  is  not  In  modifying  a  certain  DFT -algorithm,  but  in  looking 
it  possible  Co  evaluate  the  OFT  with  any  periodicity  matrix  by  using 
proper  existing  algorithms. 

3.2.1  The  General  Approach 

The  general  form  of  the  DFT  for  a  multidimensional  sequence  can 
be  represented  as  in  Eq.  (3.20).  When  the  periodicity  matrix  N  is 
diagonal,  which  is  the  most  commonly  encountered  case,  Eq.  (3.20)  can  be 
expressed  as 


N.-l  N  -1 

1  m 

X(kltk2, . . .  ,km)  -  ^  ...  ^  x(n2 . n^expf^sk^)  (3.22) 

v°  v° 

.  .  .  exp(-j2rk  n  ) 

tQ  m 


with  k^  »  0,l,...,Nj-l,  for  i  •  1,2,. ...m. 

It  Is  straightforward  to  compute  the  DFT  when  it  is  in  the  form  of 
Eq.  (3.22)  by  using  a  row-column  decomposition  with  a  one-dimensional  DFT 
algorithm  such  as  the  FFT  f 8 ]  or  WFTA  [3). 


<3-^3 


\ 
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If  the  periodicity  matrix  la  not  diagonal.  It  becomes  cumbersome 
to  compute  the  DFT  by  applying  the  existing  DFT  algorithms.  The  method 
to  be  introduced  stems  from  the  decomposition  of  the  periodicity  matrix  N 
for  the  case  for  the  general  form  of  circular  convolutions  and  obtains 
a  DFT  of  the  form  of  Eq.  (3.22)  from  Eq.  (3.21).  We  know  that  N  can  be 
decomposed  as 


V  N  V  -  D 


(3.6) 


where  U  and  V  are  regular  unimodular  matrices  and  D  is  a  diagonal  matrix. 
If  we  take  the  inverse  of  S',  we  have 


By  substituting  N-1  into  Eq.  (3.21),  we  have 

X(k)  -  y  x(n)exp(-jk’(2^V  D_1U)n) 

Eel,, 


-  V  x (n) exp (-)  (k  ’  V)  (2ttD-1)  (Un)  ) 


or 


£  x(n)exp(-j((k'v))D(27iD-1)((Un))D)  (3.23) 


-cIN 


for  k  c  JN. 


At  this  stage  it  is  necessary  to  show  that  ((Un))p  and  ((k'v))^ 
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give  distinct  vectors  for  n  e  1^  and  k  c  J^. 

In  the  previous  sub-section,  x(n)  and  X(k)  are  defined  to  be  the 
periodically  extended  versions  of  x(n)  and  X(k)  with  the  periodicity 
matrices  M  and  N ' ,  respectively.  However,  they  can  also  be  considered 
to  be  rectangularly  periodic  with  the  periodicity  matrix  D  In  the  new 
coordinates  which  are  composed  of  the  column  vectors  of  U_1  for  x(n)  and 
the  column  vectors  of  (V^)  ^  for  X(k).  The  former  was  proven  In  the 
previous  section;  the  latter  can  be  proven  similarly.  If  Eq.  (3.6)  Is 
transposed,  we  have 

v'  N'  u’  -  D’  -  0  (3.24) 

Thus  the  new  coordinates  for  X(k)  can  be  obtained  from  the  column  vectors 

i  -1 

of  (V  )  on  which  X(k)  Is  rectangularly  periodic.  This  means  that 
((Un))p  and  ((V  k))p  for  rj  e  Ijj  and  ]c  c  match  every  point  In  Ip  and 
Jjj,  respectively,  where  Ip  and  Jp  denote  the  set  of  samples  in  one  period 
of  x(n)  and  X(k)  corresponding  to  the  new  periodicity  matrix  D. 

With  some  changes  in  variables  in  Eq.  (3.23)  such  that 


"  I’  (3.25.a) 

((Un))D  “  H  •  (3.25.b) 


Eq.  (3.23)  becomes 
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X<Q',)"1I»;i  "  ^  *<<jfl“>Vxp(_J£’(2l,D"l)H>-  i  £  JD  (3-26) 

This  Is  in  the  fora  of  a  DFT  of  a  rectangularly  periodic  sequence.  Thus 
Eq.  (3.26)  can  be  computed  as  follows;  1.  Map  every  ooint  of  x(n)  into 
the  new  coordinates  by  premultiplying  the  cransfer  matrix  U.  2.  Compute 
the  DFT  in  the  same  manner  as  for  a  rectangularly  perludic  sequence. 

3.  Since  the  result  is  on  the  new  coordinates,  nap  the  result  back  into 
the  original  coordinates  to  obtain  X(k)  by  premultiplying  (V’)_1. 

In  the  following  subsection,  the  evaluation  of  a  two-dimensional 
DFT  will  be  introduced  to  help  illustrate  the  whole  procedure. 

3.2.2  Example 

Let  us  consider  Tlgure  3.2.  The  periodicity  matrix  of  the 
sequence  is 


And  the  regions  I,,  and  can  be  chosen  such  that 


IN  -  {(0,0),(0,l),(l,0),(l.n}  -  jn  . 

We  know  from  chi  previous  section  that 
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Thus  1^  and  Jp  can  be  chosen  such  that 

Ip  -  {(0,0). (0.1). (0.2), (0,3)}  -  JD  . 

Then  every  point  In  L.  Is  mapped  Into  1  as  follows, 

£*  u 


*3. 


(0.0) 

(0.0) 

i  1  o~j 

(0.1) 

(0.1) 

1 

X 

L-2 

(1.0) 

(1.-2) 

(l.i) 

(l.-l) 

If  we  take  the  modulo  D  operation  for  each  vector  in  the  right  side,  we 
have 


((0.0))D  - 

(0.0) 

((o.i))D  - 

(0.1) 

«l»-2»D  ■ 

(0,2) 

((l.-l))D  - 

(0,3) 

Thus  each  point  in  is  mapped  such  that 


'  fc; 


/ 
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coordinates,  k^  and  k2>  This  can  be  done  by  premultiplying  by 

(V*)"1  as  follows. 


(V*)'1 


n 

Li  oj 


(-1.0) 

(-2.0) 


(-3.0) 


If  we  take  the  modulo  n'  operation  for  each  term  on  the  right  side,  we 


((0,0))N.  -  (0,0) 
«-l,0))v,  -  (1,1) 

<(-2.0))N-  -  (0.1) 

((-3.0))N.  -  (1,0) 


Thus  each  point  in  is  mapped  into  in  such  a  manner  that 


This  mapping  is  the  last  step  in  the  procedure  for  obtaining  the  DFT  of 
sequences  with  non-diagonal  periodicity  matrices. 


<J-JQO  I 
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CHAPTER  IV 


COMPUTER  PROGRAMMING  THE  WFTA 


The  procedures  introduced  in  the  previous  chapter  are  very  general 
methods  for  computing  circular  convolutions  and  DFTs  for  multidimensional 
sequences.  Based  on  the  procedure,  a  computer  program  has  been  written 
using  the  WFTA  to  evaluate  general  multidimensional  DFTs.  Since  the 
procedure  for  the  evaluation  of  a  multidimensional  circular  convolution 
is  very  similar  to  the  one  for  a  DFT,  the  discussion  will  be  confined  to 
programing  Issues  associated  with  the  WFTA. 


4.1  Winograd  Fourier  Transform  Algorithm  (WFTA) 

With  the  WFTA,  a  composite  DFT  of  site  N,  where  N  is  the  product 
of  d  relatively  prime  factors  Nj.Nj. • • . .N^  is  mapped  into  a  multi¬ 
dimensional  DFT  of  size  NjXNjX. .  .xN^  using  the 


n  »  ^  (N/N^Jn^,  modulo  N,  n'»  0,1,...,N-1 

nA  -  0.1 . Nt-1 

d 

k  ■  y  (N/N^Jk^  modulo  N,  k  “  0,1,...,N-1 
1*1 

k1  •  0,1,...,N1-1 


index  mapping  scheme  Introduced  by  Cood  [11].  This  multidimensional  DFT 
is  then  expressed  in  the  form  of  a  nesting  of  d  different  one-dimensional 
small  DFTs.  For  example,  for  a  DFT  of  size  x  N2>  the  two-dimensional 


cP-<aa/ 


DFT  can  be  expressed  as 


Nj-1  M2-l 


XCkj.kj)  -  y  y  x(i»1,n2)exp  n^j  exp  |-J  jp  n2k2  )  (4.1) 

_A  _  1  2 


n^«0  n2«0 


After  the  terms  are  rearranged,  Eq.  (4.1)  becomes 

n2-i  Ni-1  v 

X(kx,k2)  -  y  /  y  x(n1(n2)exp  Jexp  n2k2J  (4.2) 

n2-0  \  n^O  1  / 


If  ue  denote 


Nrl 

(kltn2)  -  y  x(nltn2)exp  (-J  n^j  » 

nJ^O  1  ‘ 


(4.3) 


Then  Eq.  (4.2)  can  be  expressed  as 


V1 


X(*Vk2)  "  y  *(k1,n2)exp  |-j  ~  n2k2)  ' 


n2»0 


(4.4) 


Equation  (4.4)  is  a  DFT  of  length  N2  where  each  multiplication  step 
represents  a  DFT  of  length  In  which  each  multiplication  by 
CXP(~J  ^  niki^  has  been  replaced  with  a  multiplication  by 


2tt 


2tt 


exp(-j  jj-  njk^expf-J  —  n2k2) .  In  other  words,  Eq.  (4.4)  is  a  DFT  of 
length  Nj  nested  in  a  DFT  of  length  N2<  This  procedure  can  be  easily 


extended  to  higher  dimensions.  Thus  If  M  is  defined  to  he  the  total 
number  of  multiplications  necessary  In  evaluating  the  DFT  of  size  N  and 


for  the  DFT  of  size  N^, 

d 

-fk 

i-i 


4.2.  Computer  Program 

The  whole  program  is  divided  Into  two  phases:  a  generation  phase 
and  an  execution  phase  (Figure  4.1).  In  the  generation  phase,  some 
napping  vectors  (mapping  vectors  1,  2,  and  3)  and  coefficients  are 
computed  which  are  used  In  the  execution  phase  (Figure  4.2).  The 
decomposition  of  the  periodicity  matrix  also  takes  place  at  this  time. 

The  execution  phase  is  composed  of  five  parts,  as  shown  in  Figure  4. 2. a. 
Precomputed  elements  from  the  generation  phase  are  used  at  this  time. 
Mapping  vector  1  Is  used  in  step  1  to  map  the  points  in  into  the  new 
coordinates  so  that  the  general  form  of  the  DFT  can  be  changed  into  the 
form  of  a  DFT  for  a  rectangularly  periodic  sequence,  and  mapping  vector 
3  is  used  to  map  the  result  from  step  4  back  into  the  original  coordinates 
to  give  the  final  result.  For  a  given  rectangularly  periodic  sequence. 

If  the  periodicity  matrix  has  composite  diagonal  elements,  computational 
savings  result  by  mapping  this  sequence  Into  a  higher  dimension,  which 
takes  place  in  step  2  using  the  precomputed  mapping  vector  2.  Mapping 
vector  2  Is  also  employed  In  step  4  in  performing  the  Inverse  of  this 
mapping. 
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A  more  detailed  schematic  diagram  for  step  3  is  shown  in  Figure 

4.2. b.  Step  3  la  divided  into  three  parts:  input  addition,  multiplica¬ 
tion,  and  output  addition.  The  input  addition  part  gets  K  input  data 
points  and  produces  M  output  data  points  which  are  fed  to  the  multiplica¬ 
tion  part,  where  N  and  M  represent  the  total  number  of  points  In  the 
sequence  and  the  total  number  of  multiplications  needed,  respectively. 

The  data  are  multiplied  by  precomputed  coefficients,  then  the  results 
are  added  together. 

If  we  are  only  concerned  with  rectangularly  periodic  sequences, 
there  are  some  procedures  which  can  be  omitted;  the  decomposition  and 
the  computation  of  the  mapping  vectors  1  and  3  in  the  generation  phase, 
and  steps  1  and  S  in  the  execution  phase.  It  can  be  seen  from  Figure 

4. 2.  a  that  steps  2,  3,  and  4  represent  the  evaluation  of  the  DFTs  for 
rectangularly  periodic  sequences.  Thus  in  the  rectangular  case,  N  real 
memory  locations  are  required  for  the  mapping  vector,  while  in  the 
general  case,  it  appears  chat  3N  real  memory  locations  are  required. 
However,  mapping  vectors  1  and  2  can  be  combined  to  produce  a  new 
napping  vector  which.  In  turn,  results  In  combining  steps  1  and  2 

into  one  processor.  Slmllirly,  mapping  vectors  2  and  3  can  be  combined 
which.  In  turn,  results  in  combining  steps  4  and  S  (Figure  4.3).  There¬ 
fore,  with  a  little  more  additional  computation  in  the  gencratlo..  nhase, 
real  memory  locations  for  the  mapping  vectors  can  be  reduced  from  3N 
to  2N.  Furthermore,  by  combining  steps  1  and  2,  and  4  and  5,  the  execu¬ 
tion  time  and  the  program  length  of  the  execution  phase  can  be  reduced  to 
levels  which  are  comparable  to  the  rectangular  case.  In  Figure  4.3,  map¬ 
ping  vectors  l'  and  3*  result  from  the  combining  of  the  mapping  vectors 


Figure  4.3.  (b)  Block  Diagram  for  the  Execution  Phase  In  Figure  4.3. 
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1  and  2,  and  2  and  3  In  Figure  4.1,  and  pro.eaaor  1*  and  3'  are  the 
combining  of  ateps  1  and  2,  and  4  and  5,  tspectively,  in  Figure  4. 2. a. 

It  can  be  concluded  from  the  above  verification  that  the  cost 
for  the  generalization,  compared  to  the  rectangular  case,  is  the  follow¬ 
ing. 

1.  A a  increase  in  the  program  length  and  the  execution  time  of 
the  generation  phase  due  to  the  decomposition  procedure  and 
the  computation  of  mapping  vectors  1,  3,  l'  and  3'. 

2.  N  more  real  memory  locations  for  the  mapping  vectors. 

As  shown  in  Figures  4.1  and  4.3,  the  generation  phase  is  merely 
a  preparation  step  which  computes  the  data  necessary  for  the  execution 
phase  and  stores  them  in  the  memory  for  later  use.  Thus  the  major  cost 
for  the  generalization  is  N  more  real  memory  locations.  For  reference, 
the  execution  times  of  the  program  are  listed  in  Table  4.1  for  various 
periodicity  matrices  with  different  sizes.  As  expected,  there  is  little 
difference  in  execution  time  of  the  execution  step  between  the  sequences 
with  non-diagonal  periodicity  matrices  and  their  diagonally  periodic 
counterpart,  while  slight  differences  exist  for  the  generation  step. 


3-22$ 
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Table  4.1.  (a)  Generation  Tine  and  Execution  Tine  for  Two- 

Dlnensional  Hexagonaily  Periodic  Sequence. 

(b)  Generation  Tine  and  Execution  Tine  for  Two- 

Dimensional  Rectangularly  Periodic  Counterparts. 


Periodicity  Matrix 
<N) 

No.  of  Points 
(N) 

Generation  Tine 
(sec) 

Execution  Time 
(sec) 

(a) 

2 

1  " 

1 

L 

2J 

3 

0.091 

0.043 

”  4 

2  " 

-2 

4 

12 

0.113 

0.101 

8 

4  * 

4 

8 

48 

0.201 

0.353 

16 

8  ' 

8 

16 

192 

0.545 

1.344 

32 

16  * 

16 

32 

768 

2.082 

5.630 

3 


12 


0.57  0.40 


0.070  0.098 
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0.122 


0.348 


Table  4.1.  (Continued) 


Periodicity  Matrix 

hi.  of  Points 

Generation  Time 

Execution  Time 

00 

00 

(sec) 

(sec) 

(b)  (continued) 

"  8 

0  " 

192 

0.316 

1.336 

0 

24 

*16 

0  * 

768 

1.226 

3.612 

0 
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CHAPTER  V 


CONCLUSIONS 

It  has  been  shown  In  this  paper  that  any  multidimensional  sequence 
with  arbitrary  periodicity  matrices  can  be  changed  into  rectangularly 
periodic  sequences  in  a  systematic  manner  by  decomposing  the  periodicity 
matrices,  and  this  method  can  be  used  with  proper  existing  algorithms  for 
rectangularly  periodic  sequences  to  evaluate  the  general  multidimensional 
circular  convolutions  and  DFTs.  The  computer  program  for  the  evaluation 
of  IFTs  using  this  method  appears  to  be  similar  in  its  execution  time  to 
the  one  for  the  rectangular  case.  A  listing  of  the  programs  is  given  in 
the  Appendix  along  with  some  important  flowcharts. 

The  method  introduced  in  this  paper  converts  a  general  multi¬ 
dimensional  periodic  sequence  into  a  rectangularly  periodic  sequence, 
then  applies  a  one-dimensional  algorithm  which  depends  on  the  diagonalized 
version  of  the  periodicity  matrix.  It  is,  however,  hard  tc  decide  from 
the  original  periodicity  matrix,  without  decomposing  the  periodicity 
matrix,  which  algorithm  fits  best.  For  example,  in  evaluating  the  DFTs 
for  a  two-dimensional  periodic  sequence  whose  fundamental  period  is 
composed  of  12  points,  the  possible  two-dimensional  rectangularly  periodic 
counterparts  would  be  (2  x  6)  and  (3  x  4),  each  of  which  leads  to  dif¬ 
ferent  algorithms.  It  becomes  even  more  complicated  for  higher  dimensions 
and  larger  sizes.  Hence,  it  would  be  very  convenient  if  a  simple  method 
Is  found  to  predict  the  best  rectangularly  periodic  counterpart. 
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In  soae  algorithms  such  as  the  WFTA,  a  one-dimensional  sequence 
Is  napped  into  a  higher  dimension  to  give  better  efficiency  when  the  size 
is  the  product  of  relatively  prime  numbers.  This  idea  nay  be  directly 
applied  to  the  multidimensional  case.  For  example,  the  periodicity 


matrices  *]  ^ 


10 

20 


can  be  factored  into  relatively  prime 


matrices  such  that 


"  4 

*1 

“2 
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For  such  sequences,  if  a  direct  multidimensional  mapping  scheme 
is  developed,  such  as  is  used  in  the  one-dimensional  case,  possibly  more 
efficient  algorithms  can  be  found.  For  this  particular  example  given 
above,  the  decomposition  procedure  can  be  omitted  if  a  short  DFT  algorithm 
is  developed  for  ^  •  which  can  be,  I  assume,  a  slight  modification 

of  a  3-point  DFT  algorithm.  The  development  of  efficient  short  DFT 
algorithms  and  the  generalization  of  the  WFTA  and  the  prime  factor 
algorithm  are  suggestions  for  further  research. 
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APPENDIX 

PROCRAM  LIST  AND  FLOWCHARTS 


THIS  PROCRAM  IS  PROVIDED  TO  COMPUTE  THE  DFTS  OR  THE  IDFTS 
(INVERSE  DFTS)  FOR  MULTIDIMENSIONAL  SEQUENCES  WITH  ANY  PERIO¬ 
DICAL  EXTENSION.  THIS  PROCRAM  IS  DIVIDED  INTO  TWO  PHASES: THE 
CENERATION  PHASE  AND  THE  EXECUTION  PHASE.  IN  THE  GENERATION 
PHASE,  PERIODICITY  MATRIX  IS  DECOMPOSED  AND  EVERY  ELEMENT  IS 
COMPUTED  WHICH  IS  NECESSARY  FOR  REORDERING  THE  INPUT  AND  OUTPUT 
SEQUENCES.  AND  THE  COEFFICIENTS  ARE  ALSO  COMPUTED  IN  THIS  PHASE 
IN  THE  EXECUTION  PHASE  DFT  OR  IDFT  IS  COMPUTED.  THE  ALGORITHM 
USED  IN  THIS  PHASE  IS  STRICTLY  BASED  ON  "WFTA".  THE  DETAILED 
DESCRIPTION  OF  EACH  ROUTINE  WILL  BE  SUBMITTED  IN  EACH  SUBROUTINE 
AND  IN  THE  PROCRAM  IF  NECESSARY.  THE  DESCRIPTION  OF  EACH  ARRAY 
AND  VARIABLE  IS  AS  FOLLOWED. 

A;  PERIODICITY  MATRIX 

L’;  NEW  BASE  FOR  INPUT  DATA 

V;  NEW  BASE  FOR  OUTPUT  DATA 

N;  DIAGONAL  ELEMENT  OF  DIAGONALIZED  VERSION  OF  A 
OLDN;  REGION  OF  SUPPORT 
F;  FACTORS  TO  BE  USED  FOR  SHORT  DFT 
NF; RELATIVELY  PRIME  FACTORS  OF  N 

NM;  NO. OF  MULTIPLICATION’S  NEEDED  FOR  DFT  OF  SIZE  F 

FXUM;  NO. OF  FACTORS  OF  EACH  N 

P;  PERMUTATION  MATRIX 

NUM;  NO. OF  POINTS 

DM;  DIMENSION 

IRVEC1 ;  INPUT  REORDERING  VECTOR 

ORVECl;  OUTPUT  REORDERING  VECTOR 

REVEC2;  TRANSIENT  STORACE  FOR  IRVECI  6  ORVECl 

MUL;  NUMBER  OF  MULTIPLICATIONS 

INV;  IF  INV-O.  COMPUTE  DFT 

IF  INV- I,  COMPUTE  INVERSE  DFT 


THIS  IS  THE  EXECUTION  PHASE: 

DECOMPOSE  THE  PERIODICITY  MATRIX  "A"  SUCH  THAT  "UAV-D".  AND 
EVALUATE  THE  RELATIVELY  PRIME  FACTORS, IF  ANY,  OF  EACH  DIAGONAL 
ELEMENT  OF  "D".  THEN  EVALUATE  THE  PERMUTATION  MATRIX  BY  WHICH 
THE  INPUT  SEQUENCE  IS  ORDERED  PROPERLY  BY  USINC  GOOD'S  METHOD. 
COMPUTE  THE  COEFFICIENTS, TOO. 


non  o  n 
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PARAMETER  DM-2,NUM-3,INV-0 

INTEGER  A(DM,DM>,l'(DM,DM),V(DM.DM),S(5),OLDN(5).F(9),NUM 
INTEGER  NM( 16) , FNUK(DM) ,P( 5, 100) ,L(9 ) ,NF(DM,4) ,D(9) , INV.DM 
INTECER  IRVECI ( 1000) ,ORVECl( 1000) ,RSVEC2( 1000) 

REAL  C1,C2.C3,S1,S2,S3,PI 
COMPLEX  M( 16,18) ,CMP,CM( 1000) 

DATA  L/9* 1/ 

DATA  D/9*l/ 

DATA  F/16,9,8,7,5,4,3,2,I/ 

DATA  NM/ 1,2, 3, 4. 6, 0,9, 3, 11, 0,0, 0.0, 0,0, 18/ 


OPEN  7." DATA 1" 

READ  FREE(7)((A(I,J),J-l,DM).I«! ,DM) 
READ  FREE(7) (OLDS( I ) , 1-1 , 5) 

CLOSE  7 


ASSIGN'  EACH  COEFFICIENT  A  VALUE  FOR  SHORT  DFT 


DO  30  1-1,16 
DO  30  J-l  ,18 
30  M(  I  ,.!)*(  1 .  ,0. ) 

PI-ACOS(-l.) 

51- SIN(2.*PI/7.) 

52- SIN(4.*PI/7.) 

53- SIS(6.*PI/7 . ) 

Cl-COS(2.*PI/7.) 

C2-COS(4.*PI/7.) 

C3-COS(6.*PI/7.) 

M(2 , 1 )»( 1 . ,0. ) 

M(2,2)-( 1 . ,0. ) 

M(3, 1)«( 1 . ,0. ) 

M(3,2)-CMPLX(COS(2.*PI/3.)-l.,0.) 

M(3,3)-CMPLX(0.  ,SIN'(2.*PI/3.)) 

M(4 , 1 )-( 1 , ,0. ) 

M(4,2)-(l.,0.) 

M(4,3)-(l.,0.) 

M(4,4)-(0..1.) 

M(5, 1 )-( 1 . ,0. ) 

M(5, 2)-CMPLX( (COS(2.*PI/5. )+C0S(4.*PI/5. ))/2.-l.,0.) 
M(5,3)-CMPLX(0. ,(SIN(2.*PI/S.)-SIN(4.*PI/5.))) 
M(5,4)-CMPLX((COS(2.*PI/5.)-COS(4.*PI/5.))/2.,0.) 
M(5,5)*CMPLX(0. ,(SIN(2.*PI/5.)+SIN(4.*PI/5.))) 
M(5,6)-CMPLX(0.  ,SI:J<4.«PI/5.)> 

M(7, 1 )-( 1 . ,0. ) 

M(7 ,2)«CMPLX(  (Cl+C2+C3)/3.-l.  ,0. ) 

M(7 ,3)-CMPLX( (Cl+C2-2.*C3)/3. ,0. ) 


o  n  no  n  o  o  n  o  no  o 
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M(7,4)-CMPLX(0.  ,(Sl-2.*S2-S3)/3.) 

M(7,5)-CMPLX((Cl-2.*C2+C3)/3.,0.) 

M(7,6)-CMPLX<0..(Sl>S2+2.*S3)/3.) 

M<7,7)-CMPLX<0.,<2.*Sl-S2+S3>/3.) 

M(7,8)-CMPLX(0.,(Sl+S2-S3)/3.) 

M(7,9)-CMPLX((2.*Cl-C2-C3)/3.,0.) 

M(8, !)-(!., 0.) 

M(8,2)-<1.,0.) 

M(8,3)-(l.,0.) 

M(8,4)-<0.,1.) 

M(8,5)-(l.,0.) 

M(8,6)-CMPLX(0.,SlN(PI/4.)) 

M(8.7)-(0.,l.) 

M(8,8)*CMPLX(COS(PI/4. ) ,0.) 

M( 16, 1 )»( 1 . ,0. ) 

M( 16,2)»( 1 .  ,0. ) 

M(16,3)-<1.,0.) 

M(lb.4)-(0.,1.) 

M(16,5)-(l.,0.) 

M(16,6)-CMPLX(0.,SIN(PI/4.)) 

M(16,7)-(0..1.) 

M(I6.8)-CMPLX(C0S(PI/4.),0.) 

M(16,9)-(l.,0.) 

M(16,10)«CMPLX(0.,(SIN(PI/8.)-SIN(3.*PI/8.))) 
M( 16. 11)“CMPLX(0. ,SIX(PI/4. ) ) 

M(  16, 12)-CMPLX( (COS(3.*PI/8.)-COS( PI / 3. ) ) ,0.) 
M(16,13)»(0. ,1.) 

M(  16, 14)-CMPLX(0. .(SINCPI/8. )+SlN( 3,*PX/8. ) ) ) 
M( 16, 15)«CMPLX(C0S( PI/4. ) ,0. ) 

M(  16, 16)»CMPLX((COS(PI/8.  )+COS(3.*PI/8. )  )  ,0, ) 
M(  1 6, I 7)«CMPLX(0.  ,SIS(3.*PI/8.)) 

M(  1 6 , 1 8)-CMPLX(C0S(  3.  *PI/8. )  ,0. ) 


DECOMPOSE  THE  PERIOOICY  MATRIX,  AND  GET  THE  MATRICES  MU" ,"VM , AND 
"N'\  THE  DIACONAL  ELEMENTS  OF  THE  DIACOSALIZED  VERSION  THE 


CALL  DECOMP(A,U,V,N,DM) 

EVALUATE  THE  RELATIVELY  PRIME  FACTORS  OF  EACH  DIAGONAL  ELEMENT 
ARRAY  "N'*(NF) ,  AND  THE  NUMBER  OF  FACTORS(FNUM) . 


CALL  FACTOR(N,F,DM,NF,FNUM) 


j-a3b 


! 


EVALUATE  THE  PERMUTATION  MATRIX 


CALL  IPRMT(N,NF,FNUM,DM,P) 


COMPUTE  THE  MULTIPLICANTS  TO  BE  USED 


K-l 

DO  10  I-l.DM 
DO  10  J-l  ,FNUM(I) 

L(K)-SF(I,FNUM(I)-J+1) 

D(K)«SM( L(K) ) 

K-K+l 

CONTINUE 

K-l 

DO  20  11-1,0(1) 

DO  20  12-1, D(2) 

DO  20  13-1, DO) 

DO  20  14-1,0(4) 

DO  20  15-1,0(5) 

DO  20  15-1 ,D(6) 

DO  20  17-1,0(7) 

DO  20  18-1,0(8) 

DO  20  19-1, D(9) 

CMP-M(L(  1 )  ,11  )*M(L(2)  ,I2)*M(L(3) ,  I3)*M(L(4) ,  I4)*M(L( 5)  ,15) 

CMP-CMP*M(L(5),I6)*M(L(7),I7)*M(L(8),I8)*M(L(9),I9) 

CM(K)-CMP 

K-K+l 

CONTINUE 

MU L- K-l 


COMPUTE  THE  INPUT  AND  OUTPUT  REORDERING  VECTORS 


IF(INV.EQ.1)G0  TO  40 

CALL  VECTOR( OLON ,N,U,DM,P,NUM,IRVEC1 ,0,0,1) 
CALL  VECTOR(OLDN,H, V, DM, P.NUM.ORVF.Cl  ,1,0,1) 
CALL  VECTOR! OLON, N, U, DM ,P,NUM,REVEC2, 0,0, 2) 
GO  TO  41 

CALL  VECTOR! OLON, N,V, DM ,P,NUM,IRVEC1 ,0,1,1) 
CALL  VECTOR!  OLDN ,  N ,  U ,  DM ,  P ,  NUM ,  0 RVEC 1 , 1 , 1 , 1 ) 
CALL  VECT0R(0LDN, N,U, DM, P, NUM, REVEC2 ,0,0,2) 
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>b 


< 


C 

At  DO  50  I-I.NUM 

I RVEC1  ( I  )-REV£C2( IRVEC  1(D) 
0  RVEC 1 ( X ) -REVEC2 ( ORVEC 1 (I ) ) 
50  CONTINUE 

C 


OPEN  62,"RVECTOr 

WRITE  FREE(62)MUL 
WRITE  FREE(62)(IRVEC1(I),I-1 .SUM) 

WRITE  FRE£(62)(0RVEC1(I),I-1,NUM) 

WRITE  FR£E(62)(CM(I) ,1-1 ,MUL) 

WRITE  FR£E(62)(L(I), 1-1,9) 

WRITE  FREE(62)(D(I), 1-1,9) 

WRITE  FREE(62)(FNUM( I ) , 1-1 .SUM) 

WRITE  FREE(62)(NM(I), 1-1,16) 

CLOSE  62 

C  THIS  IS  THE  ESO  OF  THE  GENERATION  PHASE 
STOP 
END 


C 

C 

C  »M»**«**«*»*«<*»»*««**»00*M*»M*M«M».***M*M****.»*i 

C  * 

C  *  THIS  IS  THE  EXECUTION  PHASE: 

C  *  MAP  THE  INPUT  SEQUENCE  ON  THE  SEW  COORDINATES  ON  WHICH  THE 
C  *  SEQUENCE  IS  RECTANCULARLY  PERIODIC.  AND  COMPUTE  THE  DFT  OR 
C  *  THE  IDFT.  THEN  MAP  IT  BACK  ON  THE  ORIGINAL  COORDINATES. 

C  *  INPUT  SEQUENCE  IS  READ  ROWWISE. 

C  * 

C  A***************************************************************' 

c 


c 

c 

c 

c 

c 


PARAMETER  DM-2.NUM-12.INV-0 
COMPLEX  TS( 1000) ,CM( 1000) 

INTEGER  L(9) ,D(9) ,MUL,IRVEC1( 1000) .ORVEC I ( 1000) ,ISV 
INTEGER  FNUM(DM) ,NM( 16) 


COMPUTE  DFT 


OPEN  62, "DAT A" 

READ  FREE(62)MUL 
READ  FR£E(62)(IRVEC1(I) ,1-1 ,NUM) 
READ  FREE( 62 )( ORVEC 1( I) , 1-1 ,NUM) 
READ  FREE(6»)(CM( I) , I-I ,MUL) 

READ  FREE(62)(L(I) , 1-1,9) 

READ  FR£E(62)(D(I), 1-1,9) 

READ  FREE(62)(FNUM( I) , I- l ,NUM) 
READ  FREE(62)(NM(I), 1-1,16) 

CLOSE  62 


OOOOU  O  U  U  O  <J  oouoo 
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READ  INPUT  DATA  IN  PROPER  ORDER 


OPEN  17,”DFr’ 

READ  FREE(17)(TS(1RVEC1(I)),I-1,NUM) 
CLOSE  17 


COMPUTE  DFT  OR  I OFT  DEPENDING  ON  INV 


CALL  COMPUTERS, L.FNUM.NM. CM, DM, NUM.D.INV) 


PUT  OUTPUT  DATA  IN  PROPER  ORDER 


OPEN  U/'IDFT" 

WRITE  FREE( 1 1 ) (TSC0RVEC1 < I)), 1-1 ,NUM) 
CLOSE  1 1 


C 

STOP 

END 


£-2*59 
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C  * 

C  * 

C  * 

C  * 

C  * 

C  * 

C  * 

C  * 

C  * 

c  * 
c  * 
c  * 
c  * 
c  * 
c  * 
c  * 

SUBROUTINE  DECOMP(A,UK,VK,N,M) 

INTECER  M.STEP.CHEK 

INTEGER  A<M,M),W(5.5),P(5,5),Q(5.5),U(5,5).N(5) 

INTECER  V{  5. 5)  ,UK(M,M)  ,  VK(M.M)  ,PB(  5,  5)  .UK  5,5)  ,VJ(  5.5) 

C 

C 

C  INITIALIZING  STEP 

C 

C 

IF(M.EQ.5)GO  TO  60 
DO  61  I-M+1,5 
61  N(I)-1 

60  DO  10  I-l.M 

DO  10  J-l.M 
10  W(I,J)-A(I,J) 

CALL  IDENT(UI,M,5) 

CALL  IDENT( VJ.M.5) 

C 

C 

C  DO  THE  DECOMPOSITION 

C 

C 

DO  100  IST-l.M-1 
800  CALL  IDENTCPB.M.5) 

CALL  IDENT(P,M,5) 

700  STEP-0 

CKEK-0 
C 
c 

C  GET  THE  SMALLEST  NON-ZERO  ELEMENT  IN  ABSOLUTE  VALUE  IN  W(I,J) 
C  POR  I.J-IST, . . . ,M. 

C 

C 


THIS  SUBROUTINE  DECOMPOSES  THE  MATRIX  A  IN  SUCH  A  MANNER 
THAT  (UK)(A)(VK)-D.  WHERE  D  IS  A  DIAGONAL  MATRIX. 

A(I,J);  A  MATRIX  TO  BE  DECOMPOSED 
UK(I.J);  RESULTING  PREMULTIPLICATION  MATRIX 
UI(I,J);  TEMPORARY  PREMULTIPLICATION  MATRIX 
VK(I,J);  RESULTING  POSTMULTIPLICATION  MATRIX 
VI(I.J);  TEMPORARY  POSTMULTIPLICATION  MATRIX 
N(I);  DIAGONAL  ELEMENT  OF  D  (-W(I.I)) 

P(l.J);  ROW  PERMUTATION  MATRIX 
Q(I,J);  COLUMN  PERMUTATION  MATRIX 
W( I.J);  GETS  A(I.J)  AND  ENDS  UP  WITH  D(I,J) 
U(I.J).VCI.J);  TEMPORARY  SUBTRACTION  MATRICES 
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CALL  50RT(V,M,IST, IROW, IC0L.5) 

C 

C 

C  MOVE  THE  SMALLEST  TO  THE  ROW  1ST  AND  COLUMN  1ST.  AND  GET  THE 
C  CORRESPONDING  PERMUTATION  MATRICES  P  AND  Q. 

C 

c 

IF( IST.NE. IROW)GO  TO  20 
CALL  IDENT(P,M,5) 

CO  TO  30 

20  CALL  TRANS(P,M,IST,IR0W,5) 

CALL  MULT(P,PB,M,0,5) 

30  IF( IST.NE. ICOL)GO  TO  40 

CALL  IDENT(Q,M,5) 

GO  TO  50 

40  CALL  TRANS(Q,M,IST,ICOL, 5) 

50  CALL  ML'LT(PB,W,M,0,5) 

CALL  MULT(W,Q,M,I,5) 

C 

C 

c  make  all  THE  ENTRIES  OF  W(I,J)  zeros  for  j-ist+i,...,m,  and 
C  I-IST.  AND  GET  THE  CORRESPONDING  SUBTRACTION  MATRIX  V.  THEN 
C  UPDATE  W(I,J)  AND  VJ(I.J). 

C 

C 

900  JCHEK-0 

DO  200  JST-IST+l.M 
IF(U(IST,JST).EQ.O)CO  TO  200 
JCHEK-1 
CO  TO  300 
200  CONTINUE 

300  IF(JCHEK.EQ.O.AND.CHEK.EQ. 1 )G0  TO  400 

IF(STEP.NE.O)CO  TO  700 
CALL  SUBS(V,W,MtISTt 1,5) 

CALL  MULT(W,V,M,1 ,5) 

CALL  MULT(Q.VtM,0,5) 

CALL  MULT(VJ,V,M,I,5) 

STEP-1 
CHEK-1 
CO  TO  900 
C 
C 

C  MAKE  ALL  THE  ENTRIES  OF  W(I,J)  ZEROS  FOR  I-IST+1 , . . . ,M,  AND 
C  J-IST.  AND  GET  THE  CORRESPONDING  SUBTRACTION  MATRIX  U.  THEN 
C  UPDATE  W(I,J)  AND  UI(I,J). 

C 

C 

400  STEP-0 
CHEK-0 


a 
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KCHEK-0 

DO  500  KST-IST+t ,M 

IF(V(KST.IST).EQ.O)GO  TO  500 

KCHEK-1 

CO  TO  600 

CONTINUE 

IF(KCHEK.EQ.O. AND.CHEK.EQ. I )C0  TO  100 
IF(STEP.NE.O)CO  TO  800 
CALL  SUBS(U,W,M,IST,0,5) 

CALL  MULT(U,rf,M,0,5) 

CALL  MULT(U,PB,M,1,5) 

CALL  MULT(U,UI,M,0,5) 

STEP- l 
CHEK-1 
CO  TO  401 
CONTINUE 


MAKE  ALL  THE  DIAGONAL  ELEMENTS  OF  W(I,J)  POSITIVE  AND  PUT  THEM 
IN  N(I),  AND  CHANGE  VJ(I,J)  ACCORDINGLY. 


DO  11  1-1 ,M 

IF(W(I,I).CT.O)CO  TO  12 
N(I)— U(I.I) 

DO  13  J-l.M 
VJ(J,I)— VJ(J,I) 

GO  TO  11 
N(I)-W(I.I) 

CONTINUE 


PUT  THE  RESULTING  UI(I,J)  AND  INTO  UK(I,J)  AND  VK(I,J), 


DO  80  I-1,M 

DO  80  J-l.M 

UK(I,J)-UId,J) 

VK(I,J)-VJ(ItJ) 

CONTINUE 

RETURN 

END 


ouuuu 


*  >;  > .  '  •  y,f; \  w  v  ", • 


rf  *->v  v 
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THIS  SUBROUTINE  MAKES  AN  IDENTITY  MATRIX. 


SUBROUTINE  IDENT(P,M,N) 
INTECER  P(N.N) 

DO  10  I-l.M 
DO  10  J-i.M 
IF( I.EQ.J)CO  TO  100 
PCI,J)-0 
CO  TO  10 
100  P(I,J)-1 

10  CONTINUE 

RETURN 
END 


C  ******************************************************************* 

C  *  * 

C  *  THIS  SUBROUTINE  FINDS  THE  SUBTRACTION  MATRIX  CORRESPONDING  * 
C  *  TO  W(I,J),  AND  PUTS  RESULT  IN  S(I,J).  * 

C  *  * 

C  A********************************************** ******************** 

SUBROUTINE  SUBS(S,W,M,IST,KK,N) 

INTEGER  S(N,N),W(N,N) 

C 

c 

C  INITIALIZE  S(I,J)  TO  IDENTITY  MATRIX 

C 

C 

CALL  IDENT(S.M.N) 

C 

C 

C  FIND  THE  VALUES  FOR  OTHER  ENTRIES  OF  S(I,J) 

C 

c 

IF(KK.NE.O)GO  TO  100 
,  DO  20  I-IST+l.M 

20  S(I,IST)-IFIX(-W(I,IST)/W(IST,IST)) 

CO  TO  200 

100  DO  30  J-IST+I.M 

30  S(IST,J)-IFIX(-W(IST,JVW<IST,IST>) 

200  RETURN 

END 


J?- 


c  . . * . . 

c  * 

C  *  THIS  SUBROUTINE  FINDS  THE  LOCATION  OF  THE  SMALLEST  NON-ZERO 
C  *  ELEMENT  IN  ABSOLUTE  VALUE  IN  MATRIX  A(I,J>,  AND  PUTS  RESULT  IN 
C  *  IROW  FOR  THE  ROW  LOCATION  AND  ICOL  FOR  THE  COLUMN  LOCATION. 

C  * 

C  **************************** *************************** **********, 

SUBROUTI NE  SORT( A , M , I  FT. I  ROW , ICOL , N ) 

INTECER  A(N,N) 

MIN- 10000 
DO  10  I-IST.M 
DO  10  J-IST.M 

IF( f  ABS(A( I , ; ) ) .CT.MIN.OR.A{ I , J) .EQ.O)GO  TO  10 
MIN-A(I.J) 

I  ROW- 1 
ICOL-J 

10  CONTINUE 

RETURN 
END 


C  * 
C  * 
C  * 


*********************** 

THIS  SUBROUTINE  PERFORMS  THE  MULTIPLICATION  OF  TWO  MATRICES, 
A(I,J)  AND  B(I,J),  AND  PUT  THE  RESULT  IN  Afl.J), 


C  ***********************************»**«****»***,****,************** 

SUBROUTINE  MULT(A,B ,M,L,S) 

INTEGER  A(N,N),B(N,N),C(10,10) 

DO  10  I-l.M 
DO  10  J-l  ,M 
C(I,J)-0 
DO  20  K-l  ,M 

20  ca,J)-C(I,J)+A(I,K)*B(K,J) 

10  CONTINUE 

IF(L.NE.O)CO  TO  100 
DO  30  I-l.M 
DO  30  J-l  ,M 
30  B(I,J)-C(I,J) 

30  TO  200 

100  DO  40  1-1 ,M 

DO  40  J-l.M 
40  A(I,J)-C(I,J) 

200  RETURN 

END 


C  *  THIS  SUBROUTINE  FINDS  A  PERMUTATION  MATRIX  CORRESPONDING  * 
C  *  IPT  AMD  JPT,  AND  PUTS  THE  SESULT  IS  P(I,J).  * 

C  *  * 

c  ft****************************************************************** 

SUBROUTINE  TRANS( P , M , I PT, JPT. N ) 

INTECER  P(S,N) 

DO  10  I-l.M 
DO  10  J-I.M 
IF{ I.NE.J )C0  TO  20 
P(I.J)-1 
'  GO  TO  10 
20  P(I.J)-0 

10  CONTINUE 

P(IPT,JPT)«1 
P(IPT,IPT)-0 
P( JPT, IPT)“1 
P( JPT, JPT)“0 
RETURN 
END 


n  n  a  n  n  *■  **  n  n  n  r>  r> 
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c  * 

C  *  THIS  SUBROUTINE  FINDS  THE  RELATIVELY  PRIME  FACTORS  OF  EACH 
C  *  M(I). 

C  *  F(I);  FACTORSC2 , 3,4 , 5, 7,8,9 , 16) 

C  *  NF(I,J);  THE  FACTORS  OF  M(I) 

C  *  FNUM(I);  THE  NUMBER  OF  THE  FACTORS  OF  M(I) 

C  * 

C  *******************************************«******************** 

SUBROUTINE  FACTOR(M,F,DM,NF,FNUM) 

ISTECER  DM 

INTEGER  A(5),F(9),NF(DM,4), FNUM(DM) , IFC ,M( 5) 

REAL  RFC 


INITIALIZE  A( I )  AND  SF(I,J)  | 


DO  40  I-l.DM 
0  ACI)-M(I) 

DO  41  I-l.DM 
DO  41  J-l,4 
1  NF(I.J)-I 


FIND  THE  FACTORS  AND  THE  NUMBER  OF  THE  FACTORS  OF  EACH  A(I). 


DO  100  I-l.DM 
FNUM(I)-1 

30  D0200J-1.9 

IF(A(I).EQ.F(J))CO  TO  10 
200  CONTINUE 

DO  300  K-l ,8 

RFC-REAL(A( I) )/REAL(F(K) ) 
IFC-A(I)/F(K) 

IF( (IFC-RFC) .EQ.O. )C0  TO  20 
300  CONTINUE 

20  NF( I,FNUM< I))-F(K) 

A(  D-IFC 

FNUM( I )-FNUM( I )+l 
CO  TO  30 

10  NFC  I ,FNUM( I) )-F(J) 

100  CONTINUE 

RETURN 
END 


c  * 

C  *  THIS  SUBROUTINE  COMPUTES  PERMUTATION  VECTORS. 

C  * 

C  *  P(I,J)  ;  ARRAY  FOR  PERMUTATION  VECTORS 

C  * 

C  **************************************************************** 

C 

SUBROUTINE  IPRMT(N,NF ,FNUM,DM .P) 

INTECER  T(4),N(5),NF(DM,4),C(4),FNUM(DM),DM,P(5,100).NT(4) 
C 
C 

C  INITIALIZE  PARAMETERS 

C 

C 

DO  IOO  1-1 .4 

C(I)-1 

T(I)-0 

100  CONTINUE 

DO  101  1-1,5 
DO  101  J-I.100 

101  P(I,J)-0 
C 

C 

C  COMPUTE  PERMUTATION  VECTORS 

C 

C 

J-l 

DO  700  I-l.DM 
DO  10  K«l  ,4 

10  NT(K)-N(I)/NF(I,K) 

DO  20  K-l.FNUM(I) 

20  C(K)-NF(I,K) 

C 

C 

DO  300  14-1, C(4) 

DO  400  13-1 ,C(3) 

DO  500  12-1 ,C(2) 

DO  600  I1-1,C(1) 

61  IF(T< 1 ).LE.N( I) )G0  TO  60 

T( 1 )-T( 1 )-N(I) 

CO  TO  61 

60  P(  I,J)-T(  1 ) 

T(l)-T(l)+NT(l) 

J-J+l 

600  CONTINUE 

T( 1 )-T(2)+NT(2) 

T(2)-T(l) 

500  CONTINUE 


T(1)-T(3)+ST(3) 

T(2)-T<1) 

T(3)-T(l) 

CONTINUE 

T(l)-T(4)+NT(4) 

T(2)-TU) 

T(3)-T(l) 

T(4)«T(l) 

CONTINUE 

DO  30  K-1,4 

T(K)-0 

J-l 

CONTINUE 

RETURN 


oooooo  o  •—  n  o  o  o  o 
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C 

* 

c 

* 

THIS 

SUBROUTINE 

COMPUTES  THE  INPUT  AND  l 

c 

* 

VECTOR, 

OR  THE  INPUT  OR  THE  OUTPUT  MAPPINC 

c 

* 

ON  1ST  AND  10. 

c 

* 

IST-t  ; 

MAPPINC  VECTOR 

c 

* 

*2  ; 

REORDERING  VECTOR 

c 

* 

10  -0  ; 

INPUT  MAPPING  VECTOR 

c 

ft 

1  ; 

OUTPUT  MAPPINC  VECTOR 

c 

ft 

INV-0  i 

DFT 

c 

ft 

1  ; 

INVERSE  DFT 

c 

ft 

SUBROUTINE  VECTOR(ON,NN,V,M,P,NUM,REVEC .IO.IN'V.IST) 
INTECER  0N( 5) ,NN( 5) ,N( 5) ,L( 5) , REVEC( 1000) ,P(5,!00),U(5,5) 
INTECER  T(  5, 5)  ,V(M,M)  ,  10,  IN’V,  1ST, ICHEK.Nl  ,N2,N3  ,N4 


INITIALIZE  VARIABLES 


DO  13  1-1,5 
IF(I.GT.M)GO  TO  12 
N(I)-NN(I) 

Ld)-ON(I) 

CO  TO  13 
N(I)-1 

l(i)-i 

CONTINUE 

Nl-N(l) 

N2-N1*N(2) 

N3-N2*N(3) 

N4-N3*N(4) 

IF(IST.EQ.1)C0  TO  21 


COMPUTE  A  REORDERING  VECTOR 


K-l 

DO  20  Jl-1 ,N( 1 ) 

DO  20  J2-1 ,N(2) 

DO  20  J3-1 ,N( 3) 

DO  20  J4-1 ,N(4) 

DO  20  J5-1.NC5) 

J-P(5,J5)*N4+P(4,J4)*N3+P(3,J3)*N2+P(2,J2)*Nl+Pd,Jl)+l 

R£VEC(J)-K 
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K-K+l 

20  CONTINUE 
RETURN 

C 

C 

C  TRANSPOSE  THE  INPUT  MATRIX,  U,  IF  NEEDED. 

C 

C 

21  ICHEK-IEOR(IO.INV) 

DO  14  1-1,5 

DO  14  J«l,5 

IFC I.CT.M.OR.J .GT.M)CO  TO  9 
IF( ICHEK.EQ.O)CO  TO  8 
U(I.J)-V(J,I) 

CO  TO  14 

8  U(I,J)»V(I,J) 

CO  TO  14 

9  U(I,J)-0 

14  CONTINUE 
C 

C 

C  COMPUTE  THE  INPUT  OR  THE  OUTPUT  MAPPING  VECTOR  DEPENDING  ON  10 

C 

C 

DO  15  1-1,5 
DO  15  J-1,5 
T(I,J)-0 

15  CONTINUE 
C 

K-0 

DO  200  15-1, L( 5) 

DO  300  14-1 ,L( 4) 

DO  400  13-1, L(3) 

DO  500  12-1, L(2) 

DO  600  I1-1,L(1) 

K-K+l 

CALL  M0DUL0(T,N,M) 

REVEC(K)-T( 1 ,5)*N4+T( 1 ,4 )*N3+T( 1 , 3)*N2+T( 1 ,2)*N1+T( I ,  I  )+l 
DO  601  Kl-l.M 

601  T( l ,K1)-T( 1 ,K1)+U(K1 , 1 ) 

600  CONTINUE 

DO  501  K2-1.M 

T( 1 ,K2)-T(2 ,K2)+U(K2, 2) 

501  T(2 ,K2)-T( 1 ,K2) 

500  CONTINUE 

DO  401  K3-1,M 
T(1,K3)-T(3,K3)+U(K3,3) 

T(2,K3)-T( 1 ,K3) 

401  T(3,K3)-T( 1 ,K3) 

400  CONTINUE 


£50 
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00  301  K4«l ,N 
T(1,K4)-T(4,K4)+U(K4,4) 
T(2,K4)-T<1,K4) 
T(3,K4)»T( 1 ,K4) 

301  T(4,K4)»T(  1  ,IC4) 

300  CONTINUE 

00  202  K5“l ,M 
T(1,K5)-T(5,K5HU(K5,5) 
T(2,K5)-T(t,K5> 
T(3,K5)"T(  1  ,K5) 
T(4,K5)-T(1,K5) 

201  T(5.K5)-T(1,K5) 

200  CONTINUE 

C 

RETURN 


SUBROUTINE  MODULO(T.N.DM) 
INTECER  T(5,5).N(5).0M 
DO  100  r-I ,0M 

20  IF(T(l,l).LT.0)CO  TO  40 

30  IF(T<1.I).CE.K(I))G0  TO  50 

GO  TO  100 

40  T(1.I)-T(1,I)+S(I) 

CO  TO  20 

50  T( 1 ,I)«T( 1 tI)-N(I) 

CO  TO  30 

100  CONTINUE 

RETURN 
END 
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c  *****  ******************************************************** 
C  * 

C  *  t'nIS  SUBROUTINE  COMPUTES  DFTS  OR  IDFTS  DEPENDING  ON  INV. 
C  *  ITS  OPERATION  IS  WITHIN  ONE-DIMENSIONAL  ARRETS. 

C  * 

C  *  TS(N)  ;  ARRAY  FOR  INPUT  AND  OUTPUT  DATA 

C  *  ST(N)  ;  TEMPORARY  STORAGE  DURING  COMPUTATION 

C  *  CM(N)  :  ARRAY  FOR  COEFFICIENTS 

C  *  S(N)  ;  ARRAY  FOR  INPUT  AND  OUTPUT  ADDITIONS 

C  *  INV  -  0  ;  COMPUTE  DFTS 

C  *  -  1  ;  COMPUTE  IDFTS 

C  * 

C  ****«*****«*************************************************** 

C 

SUB  ROUTINE  COMPUTE  (  TS .  L .  .'SUM ,  NM ,  CM ,  DM .  SUM .  D .  INV  ) 

INTECER  SM(16),FNVM(DM),L(9),DM,LB<9>,D(9),LP.ITEST 
COMPLEX  TS( 1000) ,CM( 1000) ,ST( 1000) ,S(20) 

INTEGER  Ml ,M2,K3,MS2,NUM,INV 

INTECER  LP1.LP2.LS11LS:,L1.L2.KI,K2,MT.MP1,MP2.MS1 


COMPUTE  THE  TOTAL  NUMBER  OF  FACTORS 


LP-0 

DO  90  I-! .DM 
LP-LP+F.VUMU) 
0  CONTINUE 


* 

*  THIS  ROUTINE  PERFORMS  INPUT  ADDITIONS. 

* 


DO  91  I-l.LP 
K2-1 


UPDATE  PARAMETERS 


LP1-1 

LP2-1 

LSI-1 

LS2-1 

DO  80  K-I.LP 
80  LP1-LPI*L(K) 
LP2-LP1/L(I) 


3-253 


LS1-LP1 

LS2-LP2*DfI> 

IF<I.EQ.t)CO  TO  82 
DO  81  K-l.I-I 

81  LP1-LP1*D(K) 

C 

C 

C  DO  INPUT  ADDITIONS 

C 

C 

82  DO  92  K1-1.LP1.LS1 
K3-K2 

DO  93  J-K1.K1+LP2-1 
Ll-J 

DO  94  J1»1,L(I) 

K-l+2 

H1-M0D(K,2) 

IF(M1 .EQ.O)CO  TO  10 

S(J1)-TS(L1) 

CO  TO  11 

10  S(Jl)-STUI) 

11  L1-L1+LP2 

94  CONTINUE 
C 

CALL  CADDO(S,L,I> 

C 

L2-K2 

DO  95  J2-1,D(I) 

IF(M1.EQ.0)G0  TO  20 
ST(L2)-S(J2) 

CO  TO  21 

20  TS(L2)»S(J2) 

21  L2-L2+LP2 

95  CONTINUE 
K2-K2+1 

93  CONTINUE 

K2-K3+LS2 
92  CONTINUE 

91  CONTINUE 

C 

c  ft*************************************************** 

c  * 

C  *  THIS  ROUTINE  PERFORMS  COMPLEX  MULTIPLICATIONS. 

C  * 

c  ******************«*******************«************* 

MT-1 

DO  40  I-l.LP 
MT-MT*D(I) 

IF(Mt.EQ.t)CO  TO  50 


< 


40 
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DO  60  X-l.MT 

I 

1  60 

TS(I)«TS<I)*CM(I) 

CO  TO  71 

I 

1  50 

DO  70  I-l.MT 

Eg 

1  70 

ST(1)-ST(I)*CM(I) 

H 

1  c 

R 

1  c  * 

01 

i  c  * 

* 

ts 

1  c  * 

THIS  ROUTINE  PERFORMS  OUTPUT  ADDITIONS. 

* 

I  c  * 

* 

Q 

1  c  * 

1 

DO  100  I-l.LP 

K2-1 

ITEST-LP-I+1 

1  c 

1 

1  c 

3 

1  c 

UPDATE  PARAMETERS 

g 

1  c 

1  c 

MP2-1 

DO  101  K-1,1 

| 

1  101 

MP2«MP2*L(LP-K+2) 

MS1«MP2*D( ITEST) 

MS2-MP2*L( ITEST) 

MP1-MP2 

DO  102  K-l, ITEST 

1  102 

MPl-!fP!*0(K) 

,1  c 

B 

1  c 

P 

1  c 

DO  OUTPUT  ADDITIONS 

3  c 

| 

1  c 

DO  120  Kl-l.MPl.MSl 

K3-K2 

no  110  J-K1.K1-WP2-1 

Ll-J 

DO  200  Jl-l.D(ITEST) 

K-I+2 

M2-MOD(K,2) 

IF(M1 .EQ.M2)G0  TO  201 

S(J1)-TS(L1) 

CO  TO  202 

3  201 

S(Jl)-STUl) 

[1  202 

L1-L14MP2 

LJ  200 

CONTINUE 

[1  c 

ll  c 

IF(L( ITEST) . £Q . 2 )G0  TO  3001 

CALL  CADDKS.XNV.L, ITEST) 

L2-K2 

DO  300  J2“l ,L(ITEST) 
IF(M1.£Q.M2)C0  TO  301 
ST(L2)-S(J2) 

CO  TO  302 
TS(L2)-S(J2) 

L2-L2+MP2 

CONTINUE 

K2-K2+1 

CONTINUE 

K2-K34MS2 

CONTINUE 

CONTINUE 


IF(M1.EQ.M2)C0  TO  401 
DO  400  I-l.NlW 
TS(I)-ST(I) 

RETURN 

END 
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C  ************************************************************* 

c  * 

C  *  THIS  SUBROUTINE  PERFORMS  ADDING  OF  INPUT  DATA  FOR  SHORT 
C  *  DFTS  BEFORE  THE  MULTIPLICATION  STEP. 

C  • 


SUBROUTINE  CADDO(S,L.I) 

COMPLEX  S(20),T,T1,T2,T3 
INTEGER  L(9) , I 
IF(L(I).EQ.l) RETURN 
IF(L( I) .EQ.2)GO  TO  20 
IF(L(I).EQ.3)CO  TO  30 
IF(L<I).EQ.4)CO  TO  40 
IF(L(I).EQ.5)CO  TO  50 
IF(L(I) .EQ.7)CO  TO  70 
If (L(I).EQ.8)GO  TO  80 
IF(L(I).EQ.16)CO  TO  160 
C  ************************************ 

C  *  * 

C  *  FOT  2-POINT  SHORT  DFT  * 

C  *  * 

C  ************************************ 

20  T-S(l)+S(2) 

S(2)“S( 1 )-S(2) 

S( 1 )“T 
C 

RETURN 

31 

C  ******* ********************* ******** 

c  *  * 

C  *  FOR  3-POINT  SHORT  DFT  * 

C  *  * 

C  ************************************ 

30  T-S(2)+S<3) 

S(3)“S(2)-S(3) 

S( I )"T+S( 1 ) 

S(2)-T 

C 

RETURN 

Q  ************************************ 

C  *  * 

C  *  FOR  4-POINT  SHORT  DFT  * 

C  *  * 

C  ************************************ 
40  T-S(l)+S(3) 

S(3)“S( 1 )-S(3) 

S(l)-T 

T-S(2)+S(4) 

S(4)-S(2)-S(4) 


'^0^0%' 


so 


S(2)-T 

T-S(l)+S(2) 

S(2)-S(l)-S(2) 

S(l)-T 

RETURN 


C  *  FOR  5-POINT  SHORT  DFT  * 

C  *  * 

C  ******************<.***************** 

50  T-S(2)+S(5) 

S(5)-S(2)-S(5) 

S(2)-T 

T-S(4)+S(3) 

S(3)“S(4)-S(3) 

S(4)-T 

T-S(2)+S{4) 

S(4)-S(2)-S(4) 

S(2)-T 

T-S(?)+S(5) 

S(l)-S(l)+S(2) 

S(6)-T 

C 

RETURN 

C  ************************************ 

c  *  * 

C  *  FOR  7-POINT  SHORT  DFT  * 

C  *  * 

C  ************************************ 
70  Tl-S(2)+S(7) 

S(7)-S(2)-S(7) 

S(2)-T1 

Tl-S(5)+S(4) 

S(4)-S(5)-S(4) 

S(5)-T1 

T1*S(3)+S(6) 

S(6)-S(3)-S(6) 

S(3)-T1 

Tl"S(2)+S(5)-*-S(3) 

S(I)-S(1)+T1 

T2-S(2>S(5) 

S(5)-S(5)-S(3) 

S(,)-S(3)-S(2) 

S2)-T1 

Tl*S(7)+S(4)+S(6) 

T3“S(7)-S(4) 

S(4)-S(4)-S(6) 

S<6)-S(6)-S(7) 


<S-J57 


80 


S(2)-T 

T-S(l)+S<2) 

S(2)-S(l)-S(2) 

S( 1 )“T 
C 

RETURN 

C  *  * 

C  *  FOR  5-POINT  SHORT  OFT  * 

C  *  * 

C  ************************************ 

50  T-S(2)+S(5) 

S(5)-SC2)-S(5) 

S(2)-T 

T-S(4)+S(3) 

S(3)"S(4)-S(3) 

S(4)-T 

T-S(2)+S(4) 

S(4)*S(2)-S(4) 

S(2)-T 

T-S(?)+S(5) 

S(l)-S(l)+S(2) 

S(6)“T 

C 

RETURN 

C  ************************************ 

c  *  * 

C  *  FOR  7-POINT  SHORT  OFT  * 

C  *  * 


C  ************************************ 
70  T1«S(2)+S(7) 

S(7)-S(2)-S(7) 

S(2)-TI 

Tl-S<5)+S(4) 

S(4)-S(5)-S(4) 

S(5)-T1 

TI-S(3)+S(6) 

S(6)-S(3)-S(6) 

S(3)-T1 

Tl-S(2)+S(5)*S(3) 

S(1)-S(1)+T1 

T2-S(2)-S<5) 

S(5)-S(5)-S(3) 

S(,)-S(3)-S(2) 

S  2)-Tl 

Tl-S(7)+S(4)+S(6) 

T3-S(7)-S(4) 

S(4)-S(4)-S(6) 

S(6)-S(6)-S(7) 


£~£5rJ 


oo  o  o  o  o  n 
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S<7)-T3 

S(8)-T1 

S<9)-T2 

C 

RETURN 


FOR  8-POINT  SHORT  DFT 


0  T-S(l)+S(5) 

S(5)«S(1)-S(5) 

S(l)-T 

T“S(3)+S(7) 

S(7)-S(3)-S<7) 

S(3)-T 

T-S(2)+S(6) 

S(6)-S(2)-S<6) 

S(2)-T 

T-S(4)+S(8) 

S(8)«S(4)-S(8) 

S(4)-T 

T-S(l)+S(3) 

S(3)-S(l)-S<3) 

S(l)-T 

T-S(2)+S(4) 

S(4)-S(2)-S(4) 

S(2)-T 

T-S(l)+S(2) 

S(2)-S(l)-S(2) 

S(l)-T 

T-S(6)+S<8) 

S(8)-S(6)-S<8) 

S(6)-T 

C 

RETURN 

C  ***********************************, 
C  *  , 

C  *  FOR  16-POINT  SHORT  DFT  ' 
C  *  i 

C  ft********************************** 
160  T-S(l)+S<9) 

S(9)-S(l)-S<9) 

S(l)-T 

T-S(5)+S(13) 

S(13)-S<5)-S(13) 

S(5)-T 

T-S(3)+S<11) 

S(11)«S(3)-S(11) 


3-a5%, 
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S<3)-T 

T-S(7)+S(15) 

S<15)"S(7)-S(15) 

S(7)-T 

T-S(2)+S(J0) 

S( 10)“S(2)-S( 10) 

S(2)-T 

T-S(6)+S<14) 

S(14)-S(6)-S<14) 

S'6)-T 

T«S(4)+S( 12) 

S< 12)"S(4)-S( 12) 

S(4)-T 

T-S(8)+S(16) 

S(16)-S(8)-S(16) 

S(8)-T 

T-S(l)+S(5) 

S(5)-S(l)-S(5) 

S( 1 )“T 

T-S(3)+S(7) 

S(7)-S(3)-S(7) 

S(3)-T 

T-S(2)+S(6) 

S(6)«S(2)-S(6) 

S(2)-T 

T-S(4)+S(8) 

S(8)“S(4)-S(8) 

S(4)-T 

T-S(l)+S(3) 

S(3)-S(l)-S(3) 

S<l)-T 

T-S<2)+S(4) 

S(4)-S(2)-S(4) 

S(2)-T 

T-S(l)+S(2) 

S(2)-S(l)-S(2) 

S(l)-T 

T-S(6)+S<8) 

S(3)-S(6)-S(8) 

S(6)-T 

T«S(11)+S(15) 

S(15)-S(ll)-S(15) 

S(U)-T 

T-S(10)+S(16) 

S(16)-S(t0)-S(16) 

S(10)-T 

T«S<14)+S(12) 

S( 12)“S( 14)-S( 12) 
S(14)-T 

S(17)-S(10)+S(14) 

S(18)“S(16)+S(12) 


4-2t5<? 


ooooo^o  nonnn 
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RETURN 

END 


C  »*t«MM«*»**M***M*»M»*«*»***«**M*»*t***»**M**«****Mt**l 

c  * 

C  *  THIS  SUBROUTINE  PERFORMS  ADDING  AFTER  THE  MULTIPLICATION 
C  *  STEP  TO  GIVE  THE  RESULT  OF  THE  DFT.  IF  INV-1 .COMPUTES  INVERSE 
C  *  DFT.  AND  IF  INV-O,  COMPUTES  DFT. 

C  * 

c  *****************************************************************, 

SUBROUTINE  CADD1(ST,INV,L,I) 

INTEGER  INV,I,L<9) 

COMPLEX  T,ST(20) ,T1 ,T2,T3 
IF(L(I).EQ.  DRETURN 
IF(L(I).EQ.3)C0  TO  300 
IF(L(I).EQ.4)C0  TO  400 
IF(L(I).EQ.5)C0  TO  500 
IF(L(I).EQ.7)C0  TO  700 
IF(L(I).EQ.8)C0  TO  800 
IF(L(I).EQ.16)G0  TO  600 

C  ************************************ 

c  *  * 

c  *  FOR  3-POINT  SHORT  DFT  * 

C  *  * 

c  ************************************ 

300  T-ST(t)+ST(2) 

ST(2)-ST(3)+T 
ST(3)-T-ST(3) 


IF  IT  IS  FOR  INVERSE  DFT. 


IF(INV.EQ.I)CO  TO  10 
T-ST(3) 

ST(3)-ST(2) 

ST(2)-T 

0  RETURN 


l 


k 

\ 


FOR  4-POINT  SHORT  DFT 


400 


T-ST(3)+ST(4) 
ST(4)-ST(3)-ST(4) 
ST(3)*ST<2) 

ST(2)-T 
C 
C 

C  IF  IT  IS  FOR  INVERSE  DFT, 

C 

C 

IF( INV.EQ. 1)G0  TO  40 
T-STC4) 

ST(4)-ST(2) 

ST(2)-T 

C 

40  RETURN 


C  ************ *********x******i 

c  * 

C  *  FOR  5-POINT  SHORT  DFT 

C  * 

C  ******************* *********1 

500  T-ST(1)+ST<2) 

ST(2)-T+ST(4) 
ST(4)-T-ST(4) 

ST( 5)“STf  5)-ST(6) 
ST(6)-ST(6)+ST(3) 


ST(5)-ST(2)-ST(5) 

ST(2)-T 

ST(3)-ST(4)+ST(6) 

ST(4)-ST(4)-ST(6) 

C 

C 

C  IF  IT  IS  FOR  INVERSE  DFT, 

C 

C 


T-ST(2)+ST(5) 


IF( INV.EQ. I )C0  TO  50 
T-ST(S) 

ST(5)-ST(2) 

ST(2)-T 

T-ST(4) 

ST(4)-ST(3) 

ST(3)-T 

C 

50  RETURN 


n  n  n  n  n  nj  n  ooooo 


C  *  FOR  7-POINT  SHORT  DFT 

C  * 

C  ****************************** 

700  T-ST<1)+ST(2) 

ST(2)-T+ST(9)+ST(5) 
ST(9)"T-ST(9)-ST{ 3) 
ST(5)-T-ST(5)+ST<3) 
T“ST(8)-ST(7)~ST(6) 

ST( 7 >-ST( 8>+ST( 7 )+ST( 4 ) 

ST(4)-ST(S)-ST(4)+ST(6) 

STC3)-ST(9)+T 

ST(6)-ST<9)-T 

T-ST(2)+ST(7) 

ST(7)-ST(2)-ST(7) 

ST(2)-T 

T-ST(5)+ST(4) 

ST(4 )“ST( 5)-ST(  4 ) 

$T(5)-T 


IF  IT  IS  FOR  INVERSE  DFT, 


IF( INV.EQ. I )G0  TO  70 
T-ST(7) 

ST(7)-ST(2) 

ST(2)-T 

T-ST(6) 

ST(6)-ST(3) 

STOW 

T-ST(5) 

ST(5)-ST(4) 

ST(4)-T 

0  RETURN 


FOR  8-POINT  SHORT  DFT 


800  T-ST(3)+ST<4) 

ST( 4 )*ST( 3)-ST<  4 ) 
ST(3)-T 
T-ST(5)+ST<8) 
ST(8)-ST(5)-ST(8) 


uuuuu  u  »  u  u 
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T-ST(7)+ST(6) 

ST(6)-S«7)-ST(6> 

ST(7)-T 

T-ST(5)+ST(7) 

ST(7)«ST(S)-ST(7) 

ST(5)-T 

T-ST<8)+ST(6) 

ST(6)«ST(8)-ST(6) 

ST(8)-T 

T«ST(2) 

ST(2)-ST<5) 

ST(S)-T 

T-ST(4) 

ST(4)-ST(6) 

ST(o)-ST<8) 

ST(8)-ST(7) 

ST(7)-T 


IF  IT  IS  FOR  INVERSE  OFT, 


IF(INV.EQ. 1 )G0  TO  80 
T-5T(8) 

ST(8)-ST(2; 

ST(2)-T 

T-ST(7) 

ST(7)-ST<3) 

ST(3)-T 

T-ST(6) 

ST(6)«ST(4) 

ST(4)-T 

0  RETURN 


C  *  FOR  ) 6-POINT  SHORT  DFT  * 

C  *  • 

600  T-ST(3)+ST(4) 

ST(4)«ST(3)-ST(4) 

ST(3)-T 

T-ST(S)+ST(6) 

ST(6)-ST(5)-ST(6) 

ST(5)-T 

T-ST(7)+ST(8) 

ST( 8 )"ST( 7 )-ST( 8 ) 

ST(7)-T 

T-ST(5)+ST(7) 
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ST(7)-ST(S)-ST{7) 

ST<5)-T 

T-STf6)+ST(8) 

ST(8)«ST{ 6)-ST( 8) 

ST(6)-T 

T-ST(9)+ST<15) 

ST(15)-ST(9)-ST(15) 

ST(9)-T 

T-ST(l3)+ST(ll) 

ST(n)-sr(i3)-sT(n) 

ST(  1 3)*T 

ST(10)»ST(17)+ST( 10) 

ST(14)-ST(17)-ST(14) 

ST( 16)»ST( 16)-ST( 18) 

ST(12)-ST(l2)-ST(t8) 

T-ST(9)+ST(10) 

ST(9>.T  ST( 10)*St(9)-ST( 10) 

T-ST(15)+ST(t4) 

ST( 14)»ST( 1 5)-ST( 14) 

ST(15)-T 
T“ST( 13)+ST( 14) 

ST( I6)“ST( 13)-ST( 16) 

ST(13)-T 
T»ST( 1 1)+ST( 12) 

ST( 12)«ST( 1 1 )-ST( 12) 

SUID-T 

T-ST(9)+ST(I3) 

ST(  1 3)”ST(9)-ST(  13) 

ST(9)-T 

T“ST( IO)+ST( 16) 

ST(16)-ST(10)-ST(16) 

ST(10)-T 

T-ST(15)+ST(ll) 

ST(ll)-ST(15)-ST(U) 

ST(15)-T 

T-ST(14)+ST(12) 

ST( 12)“ST( |4)-ST( 12) 

ST<14)-T 

T-ST(2) 

ST(2)-ST(9) 

ST(9)-T 

T-ST<3> 

ST(3)-ST(5) 

ST(5)-T 

T-ST(4) 

ST(4)-ST(11) 

ST( 1 7)»ST( 13) 

ST(13)-T 


uo  u  o  u 


ST(18)-ST(8) 

ST(8)-ST(17) 

T-ST(15) 

ST< 15)“ST( 18) 
ST<17)-ST<6) 
ST(6)-T 
ST(ll)-ST(17) 


IF  IT  IS  FOR  INVERSE  OFT, 


IF(INV.EQ. 1 )C0  TO  60 
T-ST(t6) 

ST( 16)«ST(2) 

ST(2)-T 

T-ST(15) 

ST(15)-ST(3) 

ST(3)-T 

T-ST(li) 

ST(I4)-ST(4) 

ST(4)-T 

T-ST(13) 

ST(13)-ST<5) 

ST(5)-T 
T“ST( 12) 

ST(J2)-ST<6) 

ST(6)-T 

T-ST(ll) 

ST(11)-ST(7) 

ST(7)-T 

T-ST(IO) 

ST(10)-ST(8) 

ST(8)-T 

C 

60  RETURN 
END 


2-37«a 


CALL  IDEKT(P.K,N? 


Figure  A. 


P(IFT.JPT)«l 
P  ( I PT  ,  I P? )  =0 
p(JPT,ip?)«: 
F  ( JPT , JPT ) B0 


Q  KETURN  ^ 


6.  Flowchart  of  Subroutine  TKANS. 


Figure  A. 9.  (Continued) 
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Figure  A. 10*  (Continued) 
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ABSTRACT 

Tnls  paper  Is  concerned  with  the  use  of  2*0 
linear  prediction  for  image  segmentation.  It 
begins  with  a  brief  summary  of  the  matnemat iss 
involved  in  2-0  linear  predictive  analysis  of 
arbitrarily-shaped  regions.  Tnen,  it  introduces 
a  2-0  iPC  distance  measure  based  on  the  error 
residual  of  2-0  linear  prediction.  Finally,  it 
describes  how  the  above  results  can  be  applied  to 
image  segmentation  using  a  simple  cluster  seeking 
algorithm.  The  results  indicate  that 
arbitrarily-shaped  Image  regions  can  be  well 
identified  and  clustered  using  as  features  their 
2-0  U>C  parameters. 


INTRODUCTION 

One-dimensional  linear  prediction  has  been 
successfully  used  by  Itakura  [1]  and  Others  for 
attracting  speech  parameters  and  for  deriving  a 
IPC  distance  measure  in  speech  classification  and 
recognition.  However,  it  appears  that  there  lies 
been  no  similar  approach  In  pictorial  feature 
eatractlon  and  in  Image  segmentation  by  cluster¬ 
ing  [2,3].  Hence,  it  Is  the  purpose  of  this 
paper  to  introduce  the  use  of  2-0  linear  predic¬ 
tion  and  the  resulting  IPC  distance  for  image 
segmentation.  Because  features  in  images  typi¬ 
cally  are  irregularly  shaped,  we  begin  by  first 
formulating  the  problem  of  estimating  tne  optimal 
IPC  parameters  for  an  arbitrarily-shaped  image 
segment.  Such  a  segment  may  be  simply  or  multi¬ 
ply-connected. 


LINEAR  PREDICTION 

let  *(m,n)  represent  a  2-0 
spat  tally-discrete  array  of  intensity  image 
samples.  According  to  the  autoregressive  image 
model  introduced  in  [a]  for  use  in  predictive 
Image  coding. 


Ie(k,t)«'m-k,n-t)*a0*t(".n)  (1) 


He  can  view  the  2-0  prediction  error  sequence 
e(m,n)  together  with  the  coefficients 
(a{k,  i),  a.)  as  an  alternative  etact 
character tiation  of  the  image  signal  a(r,n).  The 
bias  coefficient  eQ  accounts  for  the  ract  that 
the  intensity  image  sample:  are  explicitly  biased 
Since  they  are  always  nonnegative.  Tne 

set  {a(k ,R  >,*0}  can  be  seen  as  a  set  of  features 
containing  information  about  the  specific  image 
segment. 

Suppose  that  x(m,n)  has  support  on  the  re¬ 
gion  o  in  the  (m.n)-plane.  Inside  R  we  identify 
several  hompgeneout  regions  D  ,v»),...,L  as  illu¬ 
strated  in  Fig.  1.  Tne  general  linear  prediction 
problem  is  to  find  a  set  of  optimal 
coefficients  (a(k,t),  a-)  which  minimile  a 
mean-squared  error 

E-I  l  e*(«»«)  (2) 

•  n 

where  e(m,n)  is  defined  by  Eq.  (1).  Tne  array 
a(fc,l)  is  shown  In  Fig.  2  to  possess  a  rectangu¬ 
lar  region  of  support  which  tn  the  general  case 
may  Include  any  other  desired  shape.  The  total 
number  of  prediction  coefficients  is  P»(Uy 
lh*l)*(R»-Ri*l)-l,  and  the  number  of  our  unknowns 
Is  P*l.  He  can  distinguish  two  cases  depending 
on  whether  the  region  is  simply  ('."l)  or 
multiply-connected  (l  >  1): 

a)  One  simply-connected  reoion  0u 

He  overcome  the  fact  that  0  has  an  irregu¬ 
lar  shape  by  considering  a  one-dimensional 
ordering  of  the  greater  rectangular  region  0  of 
the  (*,n)-plane;  i.e..  If  u  is  an  N  »N  region, 
then  a  rowwise  ordering  would  be 
0(m,n)wef|vnvlrJ.  This  ordering  maps  every  pair 
(m,n),  such  as  0  am,  n  <N-1,  onto  an  integer  J 
belonging  to,  the  ordered  set 
2-  •  (1,2,3, ... ,h‘).  If  the  information  about 
the  rowwise  scanning  of  0  is  available,  then  0(-) 
is  a  reversible  mapping  of  the  region  c  onto  the 
set  2.  ,  and  we  can  recover  (m,n)  from  J.  how 


(•)  This  work  was  supported1  by  the  Joint  Services 
Electronics  Program  under  Contrast 
f 0RAG29-81 -K-0024 . 
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the  regions  0  «r«  defined -by  the  sets  of  Inte- 
gert  Zv'U.tjf.  4*1.* . «n.  v*>.2 . <•  “h«r« 

1,  (m,n)*0',(j)  c  0 

*W(4M  <J) 

0.  otherwise 

If  we  think  Of  2  „  tnd  I  as  N2-dimens1ona1 
vectors,  the  r-th  nifniero  element  (Jr)  of  their 
component -wise  mult tpl teat  Ion  rill  give  vs  the 
r-th  pile  (r.,n)  from  the  H  pairs  which  make  up 
the  region  0  .  Thus,  re  can  consider  a 

one-d'menslon«T  Indexing  for  the  region  Dy  : 

r.ISv(.,n).ISvtC*1(jr)],  r-1,2 . Hy  (4) 


method  the  matrla  C  It  a  syxmetrte  block-ToeplItt 
matrix  and  Is  always.  positive-definite,  because 
then  R(k,  »:1,J)  equals  R(|k-t|,  |t-J|)  or 

«(|k-1|.  -U-4I). 

b)  Hultlple  disjoint  regions  D  «  v«1.2....t 

The  problem  here  It  to  obtain  a  set  of 
cocnon  coefficients  (t(k,t),  i.)  «ntch  mlnlmlte 
the  error  E  over  all  the  regions  D  ,  v«1,2,..,L 
simultaneously-  It  can  be  easily  inom  that  the 
optimal  coefficients  are  the  solution  to  the 
fol lorlng  system 

II  CJ  •  •  •  I  r  (10) 

*•1  v*l 


The  Initials  IS  mean  "Indexing*  for  the  ‘signal* 
x(m,n).  hew,  the  restriction  of  *(m,n)  or  Its 
translate  «i»-k,n-  I )  to  0  can  be  thought  of  as 
a  My-d1mens1ona1  vector:  v 

*,*( Sq(r):  Sq(r)-«(m-k,n-t),  r*lSy(m,n)]T(S) 

Where  4*lP(k,t),  and  1P(0,0)«0  It  understood. 
The  Indexings  15(*)t  1P(* )  need  not  be  the 
same.  At  tnis  point  re  can  express  the  2-0  cor¬ 
relation  lags  (4)  as  Inner-products  of  known 
vectors: 


J)*  <*„.*_>  <*> 

where  QtwIP(k.l).  q»*IP(1,J).  Similarly,  the  2-0 
shift  lags  S(k.i)  [4]  are  equal  to  the  sum  of  the 
components  of  the  vector  Sq,  q«!P(k.t). 

The  optimal  coefficients  which  minimise  the 
squared  error  £  over  the  region  0  are  the  solu¬ 
tion  to  a  system  jf  normal  equations: 

C..T  (7) 

where  C  Is  a  (P*1)x(P*l)  matrix  whose  entries  are 
equal  either  to  R(k,  i:i,j)  or  to  S(k,  s). 

•*la{10°(U) . sJIP'VjJ.  e0]T  (8) 

. <*„.*, >.1<0,0))T  (9) 

All  the  above  analysis  refers  to  the 
covariance  method  (4)  which  minimises  £  only  over 
the  region  0v  .  Alternatively,  we  ceuld  modify 
our  approach '’to  Include  also  the  autocorrelation 
method,  which  assumes  that  x(m,n)  Is  sero  out¬ 
side  0  and  ntnlmtaet  £  over  the  entire 
(m.n)-plsne.  In  the  covariance  method  the  awtrlx 
C  Is  tymnet* le  and  positive-definite,  except  for 
degenerative  cases  idiere  it  Is 
posltlve-semldeflnlte.  In  the  autocorrelation 


where  C  .  r  are  the  correlation  matrix  and  cor¬ 
relation  vector  of  the  region  0  .  The  approach 
to  obtain  the  correlation  and  I'm  ft  lags  Is  al¬ 
most  the  same  as  In  part  (a).  The  only 
difference  Is  that  in  order  to  fine  the  indexing 
IS(m,n)  for  the  ensemble  of  all  the  regions,  one 
has  to  multiply  the  vector  2-  by  the  sum  of  all 
the  vectors  2  defined  In  (3jf.  however.  If  one 
has  already  precomputed  C  and  r  ,  It  Is  easier 
simply  to  add  them  component-wise. 


2-D  IK  DISTANCE 


Let  us  consider  the  augmented  coefficient 
vector  b*[l,  -  w]T  and  the  augmented  correlation 
matrix 


A 


R(0, 0:0,0)  I 
r  ! 


(ID 


where  R(0,0:0,0)  Is  obviously  the  energy  of 
x(m,n)  over  the  analysis  region.  The  matrix  A 
may  refer  to  a  simply-connected  region  or  to 
disjoint  regions.  It  can  be  proven  that  the 
squared  error  E  can  be  expressed  at  the 
posttive-(seml)  definite  quadratic  form 

C  •  bTA  b  (12) 


Having  reduced  the  problem  to  our  one-dlmenttonal 
one  by  using  the  one-dlmens tonal  Indexing  for  the 
arrays  a(k,  t)  and  x(m,n)  over  the  regions  of 
Interest,  we  could  use  a  2-D  LPC  distance  similar 
to  the  one  used  by  Itakura  (1)  In  the  1-0  case. 
Thus,  over  an  analysis  region  possessing  augment¬ 
ed  correlation  matrix  A,  we  define  the  distance 
between  two  sets  (  «j,  m^)  of  coefficients  as 

dA(w,,  m^-llog  (m,TAw1/w2TAm2)|  (13) 


from  (13)  It  Is  Inferred  that  the  above  distance 
Is  a  semi-metric.  In  the  sense  that  It  satisfies 
all  the  properties  of  a  metric  except  one;  t.e., 
data, ,  m,)*0  does  not  Imply  that  o.*m.  .  Also, 
It  4  ft  dear  that  tntf  1  distance 


relates  a,  and  a,  only  indirectly  through  th« 
matrix  A. 1 


CLUSTERING  ALGORITHM 

let  us  suppose  nou  thit  we  «rc  given  o  2*0 
Image  Otto  orrty  x(m,n)  defined  over  e  greeter 
region  fi  .  The  storting  point  of  our  elgorltha 
for  tesge  segaentetton  by  clustering  Is  to  divide 
the  entire  laege  Into  N.  satlTer  disjoint  regions 
which  consist  of  more  or  less  homogeneous 
pictorial  texture.  This  homogeneity  Hill  be 
hopefully  reflected  in  a  statlonarlty  of  the 
prediction  coefficients  over  one  region  and  a 
similarity  between  coefficients  of  disjoint  re* 
glons  xith  similar  texture.  Then,  we  obtain  the 
augmented  correlation  matrices  A  for  each  image 
subregion.  This  way,  each  analysis  region  can  be 
thought  of  as  a  pattern  whose  features  are  the 
entries  of  the  matrix  A.  From  Eqs.  (7)  and  (11) 
it  is  clear  that  the  optimal  LPC  coefficients  can 
be  obtained  from  the  matrix  A.  Having  obtained 
the  LPC  characterization  of  each  region  (pattern) 
one  could  use  any  clustering  algorithm  which 
employs  a  distance  measure.  He  have  used  a  vari¬ 
ation  of  the  so-called  K-means  (5j  clustering 
algorithm  modified  to  use  the  LPC  distance  mea¬ 
sure.  Our  approach  it  suimarlzed  below: 

Step-1:  Select  K  Initial  cluster  centers  (re¬ 
gions)  Cj,  j-l,2,...,K.  The  selection  may  be 
either  arbitrary,  or  automatic  using  a  max-ain 
algorithm  [Sj  which  finds  the  K  LPC  patterns 
which  are  farthest  apart. 

Step-2:  Allocate  each  of  the  N.  LPC  patterns 
(characterized  by  their  correlation  matrices  At 
and/or  by  their  optimal  coefficients  a.  j  to  one 
of  the  K  cluster  centers  according  to: 

belongs  to  cluster  j  if 


dA/V  ej)<dA,t'f  CJ*  . * 

for  all  1,  1*1,2,...,Hb.  Ties  are  solved  arbi¬ 
trarily.  ¥ 

Step-3:  Update  the  cluster  centers:  Having 
found  from  step-2  that  each  cluster  consists  of 
Hi  LPC  patterns,  we  find  a  set  of  prototype  co¬ 
efficients  for  each  cluster  (its  cluster  center) 
by  using  linear  predictive  analysis  of  multiple 
disjoint  regions  (10);  i.e.,  for  each  cluster  j 
we  sum  up  the  Hi  correlation  matrices  and  vectors 
and  solve  (10). 

Step-4:  The  algorithm  terminates  whenever  the 
cluster  centers  do  not  change  from  the  previous 
iteration.  Otherwise,  go  back  to  step-2  and 
Iterate  again. 

The  above  clustering  algorithm  is  an  un- 
supervised  pattern  recognition  scheme.  He  have 
found  that  it  always  converges  in  about  3-10 
iterations.  A  good  choice  of  the  initial  cluster 
centers  may  affect  considerably  the  speed  of 
convergence.  The  performance  of  this  clustering 


algorithm  obviously  depends  ipon  the  method  used 
to  extract  the  LPC  parameters.  Thus,  if  we  use 
the  same  number  of  prediction  coefficients  and 
the  same  prediction  mask  (Fig.  2),  the  covariance 
and  the  autocorrelation  method  yield  similar 
results.  However,  the  correlation  matrix  A  in 
the  autocorrelation  method  has  much  fewer 
different  entries  because  of  its  block-ToeplItt 
property.  For  Instance,  if  P-8,  the  matrix  A  has 
only  IS  different  entries  compared  to 
tPrl)(P*2)/2-55  for  the  covariance  method.  The 
site  of  the  analysis  regions  does  net  play  an 
Important  role  as  long  as  one  stays  well  inside 
homogeneous  regions.  For  regions,  however,  which 
contain  boundaries  between  different  textures, 
smaller  analysis  regions  are  required.  The  shape 
of  the  prediction  mask  (Fig.  2)  was  found  to  be 
of  paramount  importance.  He  tried  3  different 
shapes:  1)  Qj-R^-l  gives  an  all-plane 
symmetric  mask,  2 )  Qi-O,  Q>-2,  Ri»-1,  R^-l  gives 
a  half-plane  mask  ana,  3)  Oi-Rj-O,  Qp-Ry-E  gives 
a  quarter  plane  mask.  All  these  different  masks 
involve  the  tame  number  of  prediction 
coefficients  P-8.  In  terms  of  the  average  nor¬ 
malized  mean-squared  error  E,  the  first  mask  is 
the  best  and  the  third  is  the  worst.  However,  In 
terms  of  clustering  performance  the  third  mask  Is 
the  best  whereas  the  first  is  the  worst.  The 
reason  for  this  might  lie  in  the  fact  that  the 
quarter-plane  mask  Is  the  deepest  in  both 
directions. 


EXPERIMENTAL  RESULTS 

Fig.  3  shows  a  182  >  256  pixels  black  and 
whl'.>  image  which  consists  of  64  ■  64  regions 
wis  ■  different  texture.  He  used  32  »  32  and  16  * 
16  analysis  regions  with  P-8  in  our  clustering 
algorithm,  and  the  results  were  similar  In  both 
cases.  The  8  V  ediction  coefficients  for  each 
analysis  region  were  obtained  by  using  the  auto¬ 
correlation  method  with  the  quarter-plane  3  •  3 
mask.  Fig.  4  shows  the  resulting  clusters  where 
R-3.  The  analysis  regions  were  32  «  32  pixels, 
and  each  region  is  Illustrated  by  a  number  j, 

J*l,2 . K,  corresponding  to  the  number  of  that 

cluster  which  this  region  was  assigned  to.  Simi¬ 
larly,  Fig.  i  shows  results  from  clustering  the 
same  32  *  32  regions  in  K-5  different  clusters. 
From  Fig.  4  and  Fig.  5  we  see  that  the  clustering 
algorithm  on  this  simple  image  yielded  perfect 
results  which  agree  with  our  own  perceptual 
classification  of  the  different  textures  in  the 
Image  of  Flq.  3.  The  above  good  results  were 
obtained  by  using  analysts  regions  which  were 
embedded  well  inside  homogeneous  textures.  If, 
however  the  analysis  regions  contain  more  than 
one  different  textures,  then  one  should  think  of 
reducing  the  size  of  the  analysis  regions  and/or 
employing  other  techniques  to  Isolate  the 
boundaries  between  different  textures. 
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Abnwr-ln  this  paper.  w»  (Row  that  •  one-dimeiirijwiJ  Of  muliidi- 
mrniMnil  u  uniquely  tpecified  auger  mild  retlnctioAl  by  Hi 

atgned  Foutiei  Iraniform  magnitude  (magnitude  and  I  bit  of  phue  In* 
foemation).  In  addition,  «*  develop  a  numeric*!  algorithm  to  iteon- 
■truer  a  one-dimertiional  or  multidimeniional  aequence  from  Ha  Fouriet 
tramform  magnitude.  RvconUn-ctiM  example!  obtained  u«ng  tlain 
algorithm  tie  alio  provided. 


I.  Introduction 

IN  a  variety  of  contexts,  such  as  electron  microscopy  [t  1, 
X-uy  crystallography  [2],  optics  13].  and  Fourier  transform 
signal  coding  |4) .  it  is  desirable  to  reconstruct  a  sequence  fiom 
partial  Fourier  domain  information.  As  a  consequence,  con¬ 
siderable  attention  has  been  paid  to  this  area,  and  some  signifi¬ 
cant  results  have  been  developed.  It  has  been  previously  estab¬ 
lished  [5]-(7]  that  under  very  mild  restrictions  a  Finite  extent 
one-dimensional  (l-D)  or  multidimensional  (MD)  sequence  it 
uniquely  specified  to  within  a  scale  factor  by  the  tangent  of  its 
Fourier  transform  (FT)  phase,  and  algorithms  for  implement¬ 
ing  the  reconstruction  have  been  developed.  It  is  well  known 
that,  in  contrast,  the  FT  magnitude  does  not  uniquely  specify 
a  l-D  sequence.  Fot  MD  sequences,  the  FT  magnitude  specifies 
a  sequence  to  within  a  translation,  sign,  and  a  central  symme¬ 
try  |7],  |8J,  and  reconstruction  algorithms  developed  so  fat 
have  been  successful  [7|  for  only  a  very  restricted  class  of  MD 
sequences. 

From  the  above  results,  on  the  question  of  unique  specifica¬ 
tion  of  a  sequence,  there  appear  to  be  significant  differences 
between  l-D  and  MD  sequences,  and  between  the  tangent  of 
the  FT  phase  and  the  FT  magnitude.  In  addition,  the  tangent 
of  the  phase  and  the  magnitude  of  a  complex  number,  which 
have  been  considered  in  previous  studies,  do  not  completely 
specify  the  complex  number.  In  this  paper,  we  show  that  if 
the  signed  FT  magnitude  (magnitude  and  one  bit  of  phase  in¬ 
formation)  is  considered  rather  than  the  FT  magnitude,  there 
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arc  only  minor  differences  on  the  question  of  unique  specifi- 
cation  of  k  sequence,  between  l-D  and  MD  sequences,  and 
between  the  tangent  of  the  FT  phase  and  the  signed  FT 
magnitude.  In  particular,  it  is  shown  that  under  very  mild 
restrictions,  the  signed  FT  magnitude  is  sufficient  to  uniquely 
specify  *  l-D  or  MD  sequence.  Wc  note  that  the  tangent  of 
the  phase  and  the  signed  magnitude  of  a  complex  number 
completely  specify  the  complex  number. 

In  Section  II  of  this  paper,  the  basic  theory  is  presented.  In 
Section  III  in  algorithm  fot  impleme>-.mg  the  reconstruction 
is  discussed,  and  Section  IV  illustrates  several  examples. 

II.  Theory 

In  this  section,  we  discuss  the  unique  specification  of  a  se¬ 
quence  by  its  FT  magnitude  and  1  bit  of  phase.  We  initially 
consider  the  one-dimensional  (l-D)  case  and  then  extend  the 
l-D  te>ult  to  the  multidimensional  (MD)  case.  Before  we  pre¬ 
sent  the  theoretical  results,  we  define  the  notat.-nn  that  will  be 
used  thtoughout  the  papet. 

Let  .r  (it)  denote  a  l-D  sequence  which  is  causal  and  finite  ex¬ 
tent  so  that  xfn)  is  zero  outside 0<n<L  -  I .  Furthermore, 
we  restrict  x(n)  to  be  real-valued.  Let  Jf(r)  and  X(u)  repre¬ 
sent  the  7  ttansform  and  Fourier  transform  of  xfn),  so  that 

*(*)*! ;' *(»>*■"  (») 

11*0 

*<«) -  *(.-)  x(n)e-*".  (2) 

The  Fouriet  transform  3f(u)can  be  represented  in  terms  of  its 
real  part  Xr{ui)  and  imaginary  part  Xt(u),  ot  in  terms  of  its 
magnitude  |JF(w)l  and  phase  M<*>)  **  follows: 

*(w)  ■  (w)  +  /*,(«)  •  |Ar(w)le"'(w>.  (3) 

To  ensure  that  9,(u)  is  well  defined  at  all  w.we  assume  that 
X{t)  has  no  zeros  on  the  unit  circle.  The  phase  function  6,(u>) 
in  (3)  represents  the  principal  value  of  the  phase  so  that 

-■<«,(w)<».  (4> 

The  1-bit  FT  phase  information  will  b;  represented  by  the 
function  S“(w)  defined  as 

ft)  o  -  tr  <  <  a 

\  - 1  otherwise 
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Fig.  1.  Mapping  of  the  I -til  phase  function. 


rig.  2.  Fourier  transform  magnitude,  phase.  J-bit  phase,  and  signed 
magnitude  of  the  sequence  Xt:i  *  I  ♦  3a'1  ♦  5a'  *  2a*  . 

where  a  is  I  known  constant  in  the  range  of  0  <  a  <  ».  Thus, 
the  complex  plane  is  divided  into  two  regions  separated  by  a 
straight  line  passing  through  the  origin  and  at  an  angle  a  with 
the  real  axis,  as  shown  in  Fig.  I .  For  example,  for  o  *  w/2, 
Sj,J(w)  represents  the  algebraic  sign  of  Rel-T(cj)}  More  gen¬ 
erally,  Sf(u>)  is  the  algebraic  sign  of  Re{e/<’/1'<,,^(w)}.  The 
algebraic  sign  of  zero  is  assumed  to  be  positive. 

The  function  Gj(w)  is  defined  as 

<£(cu)*S;(<u)|X(u)l  (6) 

and  will  be  referred  to  as  the  signed  Fourier  transform  magni¬ 
tude  since  it  contains  both  magnitude  and  sign  information. 
An  example  of  |A'(cj)I.  d„(fcj),  -Sj(cj),  and  Cf( w)  when  o  ■ 
»/2and  X(z)*  I  -viz"1  ♦5z'1  +  2Z*1  is  shown  in  Fig.  2. 

Final'y,  given  a  positive  integer  iV,  we  define  a  constant  P 
and  an  interval  K  as 

P  *  ~Y~  *n<*  &  *  (0.  *)  for  Af  odd 

8«  —  and  R  *  (0, »]  for  N even.  (7) 

The  uniqueness  of  a  l-D  sequence  when  the  signed  Fourier 
transform  magnitude  G?(cu)  is  specified  is  based  on  the  fol¬ 
lowing  statements.  The  proof  of  these  statements  is  given  in 
the  Appendix. 


Statement  Al :  Let  x(n)  and  y(n)  be  two  real,  causal,  and 
finite  extent  sequences.  If  |Jf(to)|  •  |)r'(w)|.  x(n)  and  >(n) 
can  always  be  expressed  as 

*(n)  -  b(«)  *  o(«) 


y(n)  « tb(n)  •  -  I  -  n) 

where  «  ■>  ♦!  or  - 1  and  a(n)  and  b(n)  are  real,  causal,  and  fi¬ 
nite  extent  sequences  with  N  corresponding  to  the  length  of 
e(n),  i.e„  a (n)  »  0  outside  0  <  *  <  Af  -  I . 

Statement  A2:  Let  b(n)  be  a  real,  causal,  and  finite  extent 
sequence.  For  any  positive  integer  Af,  the  equation 

is  satisfied  for  at  least  P  distinct  values  of  u>  in  the  interval  R, 
where  P  and  R  are  as  defined  in  (7). 

Statement  A3:  Let  a(n)  be  a  real  sequence  which  is  zero 
outside  0  <  n<  N  -  I.  If  the  equation 

Im{A(z)z<A'',>|  Jw}=0 

l»f 

is  satisfied  for  at  least  P  distinct  values  of  u  in  the  interval  R, 
then  it  is  identically  equal  to  zero  and  e(n)  =  «( N-  I  -  n). 

We  use  the  above  three  statements,  whose  proofs  are  shown 
in  the  Appendix,  to  demonstrate  the  following  theorem: 

Theorem  1 :  Let  x(n)  and  v(n)  be  two  real,  causal,  and  finite 
extent  sequences  with  z  transforms  which  have  no  zeros  on  the 
unit  circle.  If  G*/!(u >)  =  G’n(u) for  all  <*i,  then  x(n) «>•(«). 

To  show  Theorem  1 ,  we  note  from  (5)  and  (6)  that  the  con¬ 
dition  G!n(u)B  G’,!(u)is  equivalent  to 

sign{2r*(u>)]  MT(w)|  -  sign{ »-,(«)}  |  T(«)l  (8) 

which  in  turn  implies  that  |AT(u)| «  |  y(u)|,  and  therefore  that 
sign  {**(<*>))  =  sig  n{rR(u)).  (9) 

From  Statement  Al ,  then,  x(n)  and  y(n)  can  be  expressed  as 
ar(if)  »  b(n)  *  e(n) 

>(*)■»  tfc(«)*«(Ai’-  1  -  »)  (10) 

where  f  *  *1 .  Fourier  transforming  (10)  we  obtain 
X(w)‘A(u)B(u) 

y(w)-ee'M>-,»A(-w)«(w).  (II) 

To  show  that  e  *  I  in  ( 1 1 ).  we  evaluate  (9)  at  cj  *  0  and  recog¬ 
nize  that  XR  (0) »  A(0)  B(0)  and  Fj,(0)  •=  *<4(0)  8(0),  so  that 

sign{A(0)  8(0))  «  sign{eA(0)  8(0)).  (12) 

Since  Jf(w)is  not  zero  at  0,(1 2)  requires  that  e  *  +1 . 

Since  e  *  I ,  from  (10),  showing  that  x(n)  *  y(n)  is  equivalent 
to  showing  that  c(n)  •  a(N  -  I  -  it).  Toward  this  end,  we  con¬ 
sider  the  f-Tt 

jr„(w) !  y*(u). 

From  (I  • )  with  e  »  i .  it  can  be  shown  that 

**(<*>)♦  yJt(w)-2Re(4(w)e'w«v*,>'»]. 

•Re|8(«)e-fw‘w'»VJ).  (13) 
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From  Statement  A2.  there  »re  it  least  P  distinct  viluet  of  u  in 
the  interval  ft  which  we  denote  uu,,  l  *  1 . 2.  •  • .  P  for  which 

Rel«u»,)e .0.  1-1.2 (14) 

From  (13)  and  (14), 

XRiu, )♦  y*(w<)*0.  /-  1,2,'  ••  (15) 

From  (9).  both  term*  of  the  left-hand  tide  of  (15)  have  the 
tame  tign  for  all  w.  Since  a  turn  of  two  terms  having  the  tame 
tign  can  be  zero  only  when  both  terms  are  aero,  we  have 

Xk(l>,)=Y, t(u,)-0 
and  therefore  also, 

XR(cjt) -  >>(W|) *  0.  < -  1, 2,  •  •  • . r, (16) 

From  (1 1)  and  the  fact  that  e  •  ),  it  can  be  shown  that  (16) 
can  be  expressed  as 

*„(w,)-  '2  Iml^)^^-"'1) 

•lm|fl(w,)e-'w'(lV-,>'1]  =0. 

I-  1,2.  •••  .f.u.e/J.  (17) 

Since  B(u)  is  not  aero  for  any  <j.  it  follows  from  (14)  that  the 
second  factor  in  (17)  satisfies  the  property 

lmffl(u,)e'/w'<JV',,/,l*0.  r=1.2,  -./,.w,€*.  (18) 

From  (17)  and  (18), 

ImM(to()e,wftv',>/,J  *0,  i  =  1, 2, •  •  •  .F, W|€fJ.  (19) 

From  (19)  and  Statement  A3.a(n)  =  a(N  -  1  -  n)  so  that  x(rt) 
=>(»),  thus  demonstrating  Theorem  1. 

The  result  in  Theorem  I  can  be  generalized  in  various  ways. 
Specifically,  in  Theorem  I,  we  have  assumed  that  o  =  tr/2, 
which  is  a  specific  representation  of  the  1-bit  phase  informa¬ 
tion.  It  can  be  shown  that  the  statement  is  true  for  other 
choices  of  0  <  o  <  tr.  When  a  =  it  so  that  S’  (to)  *  sign[0,(cj)| , 
a  sequence  is  uniquely  specified  by  O’(u-)  when  x(0)  =  0. 
Theorem  1  can  also  be  extended  to  anitcausal  (left-sided)  se¬ 
quences.  The  proofs  of  these  extensions  can  be  found  in  (9J. 
When  the  above  extensions  are  incorporated  in  Theorem  I ,  we 
have  the  following  general  theorem: 

Theorem  2:  Let  x(n)  and  >■(«)  be  two  real,  causal  (or  anti- 
causal),  and  finite  extent  sequences,  with  r  transforms  which 
have  no  zeros  on  the  unit  circle.  If  G®(<j)  -  G“(w)  for  all  u 
and  0<a<rr,  then  x(n)»y(n).  When  a»ir.  if  CJ(<o)- 
G*(w)  and  x(0)  «  y( 0) »  0,  then  x(n)  *  y(n) 

Theorems  1  and  2  explicitly  requite  that  the  sequences  be 
real-values  and  causal  (or  anticausal).  The  necessity  cf  these 
conditions  can  be  illustrated  through  counterexamples.  Con¬ 
sider  fust  the  condition  that  the  sequences  be  real,  and  let  y(n) 
equal  e*a'  '*x(n)  where  x(n)  is  real.  In  this  case,  it  is  straight¬ 
forward  to  show  that  C*(u)  *  G"(w).  Since  G'(u)  does  not 
uniqueiy  specify  x(n),  G’(cj)  does  not  uniquely  specify  y(n). 
To  indicate  the  necessity  of  the  causality  (or  anticausality) 
condition,  consider  as  one  counterexample  the  two-sided  se¬ 
quences  x(n)  and  y(n)  for  which  the  i  transforms  are 

X(z)*-e*  *6  -z*1  •(*  *2  -  z*! )(-z  ♦  2  +  z‘*) 

Ffz)**1  ♦  4z  *2  -4z"'  tz"’  »(z  ♦  2-  z'1)1-  (20) 


For  these  two  sequences  it  can  be  easily  shown  that  |Jf(u)t  • 
I  >  (<*»)!  and  in  this  case,  then,  x(n)  and 

yin)  are  different  sequences,  but  they  have  the  tame  signed  FT 
magnitude. 

In  Theoiems  I  and  2,  uniqueness  results  were  presented  as¬ 
suming  that  the  signed  spectral  magnitude  of  a  finite  length  se¬ 
quence  is  known  for  all  frequencies  in  the  interval  (0,  2e).  In 
the  case  of  FT  phase,  it  is  possible  to  generalize  the  uniqueness 
results  to  the  case  in  which  the  FT  phase  is  known  only  for  a 
finite  number  of  distinct  frequencies.  Specifically,  it  has  been 
shown  [6)  that  for  a  finite  length  sequence  of  length  .V  which 
has  no  symmetric  (zero-phase)  factors  in  tts  z  transform,  any 
(Af-I)  samples  of  the  FT  phase  are  sufficient  to  uniquely  de¬ 
fine  the  sequence  lo  within  a  scale  factor.  Therefore,  since  the 
FT  phase  need  not  be  known  for  ai1  cj,  such  a  result  has  been 
useful  [6]  in  the  development  of  practical  algorithms  for  re¬ 
conquering  a  finite  length  sequence  from  its  FT  phase  samples. 
Unfortunately,  however,  a  fixed  finite  set  of  signed  magnitude 
samples  is  not  always  sufficient  to  uniquely  specify  a  teal, 
causal,  and  finite  length  sequence.  For  example,  consider  the 
following  two  causal  sequences  of  length  iV  -  3. 

x(n)=  1 .0  Sin)  ♦  2.6  8(n  -  l)+  1.26(«-  2)  (21) 

y(n)  =  1 .2  6(n)  +  2.6 6(n  -  l)+  1.04(n-  2).  (22) 

Since  ,v(n)  is  obtained  from  x(n)  by  flipping  both  of  the  zeros 
of  Af(z)  about  the  unit  circle,  both  x(n)  and  >■(«)  have  the 
same  spectra)  magnitude.  Furthermore,  in  the  interval  (0,  x) 
the  real  part  of  the  Fourier  transform  of  x(n)  is  equal  to  zero 
at  only  one  frequency,  0.477023a  and  the  real  part  of 
the  Fourier  transform  of  y(n)  is  equal  to  zero  only  at  cj  * 
0.526166ir.  Therefore,  the  signed  magnitude  of  X(u)  is  equal 
to  the  signed  magnitude  of  )'(<*»)  for  all  to  outside  the  intervals 
(0.477023a.  0526166a)  and  (-0.526166a,  -  0.477023a). 
Consequently,  an  arbitrary  number  of  signed  magnitude  sam¬ 
ples  within  this  region  is  not  sufficient  to  distinguish  x(n)  from 
>(")• 

Even  though  a  teal,  causal,  finite  extent  sequence  is  not 
uniquely  specified  by  samples  of  its  signed  FT  magnitude  at  a 
finite  number  of  arbitrary  frequencies,  il  is  specified  by  sam¬ 
ples  of  its  signed  FT  magnitude  at  a  finite  number  of  properly 
chosen  frequencies  which  are  different  for  different  sequence]. 
Specifically,  for  x(it)  which  is  zero  outside  0  <  n  <  N  -  I ,  the 
FT  magnitude  |T(cj)|  is  completely  specified  by  (iV-  1)  dis¬ 
crete  Fourier  transform  (DFT)  samples  In  the  interval  (0,  a). 
The  1  bit  of  FT  phase  SJ(<o)  is  completely  specified  by 
the  positions  of  in  discontinuities  and  by  its  value  at  u  -  0. 
Since  the  function  Sf(ro)  has  at  most  21V  discontinuities  in 
(  a.  »a).  G“(w)  is  completely  specified  by  a  maximum  of  3A' 
samples  at  properly  chosen  frequencies. 

In  the  above  discussion,  we  considered  only  1-D  sequences 
We  now  extent  Theorem  2  to  MD  sequences.  Let  x(»)  denote 
an  MD  sequence  x(fl|,itj,-  •  ■  ,«»<), and  let  C“(-»)  denote  the 
signed  F  T  magnitude  of  x(rr),  where  G?(a»)  repiesents  Gj(u>, , 
Uj ,  •  •  • ,  wjtr)  and  is  given  by  We  define  an  MD 

sequence  x(n)  lo  have  a  one-sided  region  of  support  in  the  Af- 
drmensionaJ  space  /r, ,  », ,  ■  ■  • ,  nw  if  it  has  nonzero  values  for 
only  one  polarity  of  each  n,.  For  example,  for  a  two-dimen¬ 
sional  sequence  there  are  four  possible  regions  of  support 
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which  arc  comb'cnl  with  Die  sequence  being  one  tided,  cor¬ 
responding  to  the  four  qiuidranu.  Theorem  3.  which  follows, 
represents  a  feneraluaiion  of  TVoeem  2  to  encompass  MD 
sequences. 

Theorem  J  tel  a  Or)  snd  rl«)  he  two  real  finite  extent  se¬ 
quences  with  one-sided  support  and  with  i  transforms  which 
have  no  zeros  at  l*tl*  |zil“  ■  •  "*  l*ar  I*  •-  ^ C?(ae)  *  Cj(nr) 
for  alt  m  and  0  <  *  <  a.  then  *<«)  *  ><■)•  When  «  •  a,  if 
C;(a»)  -  C’M  and  *(0)  •  y<0)  *0,  then  rr(»i*  ><«). 

We  demonstrate  the  validity  of  Theorem  3  foe  a  2-D  se- 
quence  wluch  has  the  first-quadrant  support  sire  M,  X  At,  to 
that 

a(/t|.*tj)*3,(«i.'ti)“0ouis*d*0<«i  <*fi  •  I  and 
0<»,  <Alt  -  I. 

The  proof  for  a  higher  dimension  and  for  a  different  quadrant 
support  is  analogous  to  the  2-D  case  with  the  first-quadrant 
support.  To  demonstiate  Theorem  3.  we  map  the  2-D  se¬ 
quences  a<ri| .  «i )  and  r(n, ,  rr > )  into  two  l-D  sequences  x(n) 
and }  (n)  by  the  following  transformation: 

x(n,  •  Afj  ♦ni)«a(n,.n,) 

y(n,  •«,  ♦n,)«y(n,.n,).  (23) 

In  essence,  the  transformation  in  (23)  cortesponds  to  mapping 
a  2-D  sequence  to  a  l-D  sequence  by  concatenating  the  columns 
of  the  2-D  sequence.  Clearly .  i(n)  and  y(n)  given  fcy  (23)  are 
real,  causal,  and  finite  extent  sequences.  From  (23),  it  is  clear 
that  the  transformation  is  invertible.  Furthermore,  it  can  be 
shown  (10]  that 

i(w)-3f(w,.w,)jWi.u.Mj<  Wj.u 

and 

f(w)-K(«,.W: 0j-w.  (24) 

From  (24),  it  follows  that  the  signed  FT  magnitudes  of  £(n) 
and  y(n)  ate  specified  by  the  signed  FT  magnitudes  of  Jt(n, , 
n, )  and  y(n, ,  n2  )•  Therefore,  if  ,  Wj )  *  6'“( o>\ ,  «i ), 

then  Cj(ui)  ■  G “(ur).  In  addition,  since  Xfl\ ,  z2 )  and  >(Z| , 
/, )  have  no  zeros  at  |;,|  «  |zj|  *  I,  from  (24),  X(:)tnd  >'(z) 
have  no  zeros  on  the  unit  circle.  Since  x(n)  and  y(n)  satisfy 
all  the  conditions  in  Theorem  2,  it  follows  from  Theorem  2 
that  i(n)  *  y(n).  Since  the  transformation  (23)  is  invertible, 
*("i  .  «i)  • y(' »i .  »j) *»  requited  by  Theorem  3. 

The  condition  that  AT(vu)  *  0  at  any  w  is  much  more  restric¬ 
tive  for  2-D  sequences  than  for  l-D  sequences,  since  X(t)  •  0 
represents  surfaces  in  the  (ci.Zj)  plane  for  2-D  sequences  and 
points  in  the  z  plane  for  l-D  sequences.  From  the  proof  of 
Theorem  3  described  above,  however,  it  is  not  necessary  to  re¬ 
quire  X(t»)  *  0  at  any  w  We  only  need  to  require  that  Af(») 
v*  0  at  the  Dices  of  «*  needed  to  form  A'(w)  in  (24).  This  is  a 
much  less  restrictive  condition  than  the  condition  in  Theo¬ 
rem  3. 

The  theoretical  result  in  Theorem  3  differs  from  that  by 
Hayes  (S)  In  several  respects.  In  the  result  by  Hayes  (S],  only 
samples  of  the  FT  magnitude  are  required,  but  the  sequence  is 
■estricted  to  have  a  nonfactoririble  z  transform  and  the  unique 
specifrcation  of  the  sequence  is  only  to  within  a  sign,  i  transla¬ 
tion,  and  a  central  symmetry.  In  Theorem  3,  the  signed  FT 


magnitude  is  required,  but  the  sequence  may  have  a  factor 
izablt  z  transform  and  it  uniquely  specified  in  the  strict  sense. 


III.  Algorithm 


In  Section  II,  we  showed  that  under  certain  conditions  a  se¬ 
quence  is  uniquely  specified  by  its  signed  FT  magnitude.  In 
this  section,  we  discuss  an  algorithm  to  implement  the  recon¬ 
struction  of  a  sequence  Jr(it)  from  its  signed  FT  magnitude. 
The  sequence  sr(n)  is  assumed  to  satisfy  the  conditions  of 
Theorem  3.  In  addition,  its  signed  FT  magnitude  C°(w)  is 
assumed  known. 

The  algorithm  that  we  have  developed  is  an  iterative  proce¬ 
dure  which  is  similar  in  style  to  other  iterative  procedures 
studied  by  Gerchbetg-Saxton  (Il|  and  Fienup  |I2|.  In  the 
iterative  algorithm,  the  "time"  domain  constraint  that  x(/i)  is 
real  and  finite  extent  with  a  one-sided  region  of  support,  and 
the  frequency  domain  constraint  thit  the  signed  FT  magnitude 
of  x(n)  is  given  by  G°(<oi.  ate  imposed  separately  in  each 
iteration.  Specifically,  let  Afv(«i)  denote  the  estimate  of  3f(w) 
at  the  pth  iteration.  The  estimate  Af,,(w)  is  inverse  Fourier 
transformed  to  the  time  domain  to  obtain  x'r(n) 

*»•  (2S) 

From  *,(«),  we  generate  an  estimate  x'^fn)  which  satisfies  the 
time  domain  constraints 


*;•(»)* 


R*l*p(»)l 

0 


for  nGA 
for  up  A 


(26) 


where  A  represents  the  known  support  region  of  x(n). 

The  sequence  *£(»)  is  then  Fourier  transformed  back  to  the 
frequency  domain  to  obtain  X'^(u)  as  follows: 

*;<«)*FW(i.)].  (27) 


The  new  frequency  domain  estimate  Xf.,(*t)  is  then  obtained 
by  enforcing  the  constraint  that  ,(»)  ■  C“(cu)as  follows: 


(|Af(«)|e%("» 

|  |3f(«)|e«1•-S^,*), 


if£;-(«)-s;(«) 

ifs;,(«)=-o;<«). 


(28) 


Specifically,  the  correct  magnitude  is  substituted  for  the  esti¬ 
mated  magnitude.  If  Sj-  fui) ■  Sj(au),  then  the  phase  of  the 
estimate  it  retained.  Othetwise,  the  estimate  is  reflected  about 
a  line  that  passes  through  the  origin  with  angle  a  to  correct  the 
sign  of  Sj~(ot).  This  completes  one  iteration.  The  initial  esti¬ 
mate  A'o(w)  wc  have  used  is  given  by 

X0(ea)*  lA’Mle^oW)  (29) 


where  0,  (w)  is  given  by 


M**)' 


[<*’  -  for  $£(«)«♦! 

^  a  ♦  ^  for  S°(**) » - 1. 


(30) 


The  iterative  algorithm  discussed  above  is  illustrated  in  Fig.  2 
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The  asymptotic  behavior  of  the  algorithm  in  Fig.  3  has  not 
yet  been  studied  theoretically.  We  have  observed  experimen¬ 
tally  that  a  stable  estimate  of  the  sequence  to  be  retrieved  is 
always  attained  after  a  large  number  of  iterations. 

To  implement  the  algorithm  in  Fig.  3.  the  Fourier  and  in- 
verse  Fourier  transform  operations  are  approximated  by  dis¬ 
crete  Fourier  transfrom  (DFT)  and  inverse  DFT  (IDFT)  opera¬ 
tions.  Although  the  uniqueness  is  not  guaranteed  in  terms  of 
the  signed  FT  magnitude  samples,  we  have  empirically  ob¬ 
served  that  the  algorithm  reconstructs  the  desired  sequence 
provided  that  the  signed  FT  magnitude  is  densely  sampled  in 
the  frequency  domain,  so  that  the  FT  magnitude  is  completely 
specified  and  the  discontinuities  of  S?(cu)  are  individually  re¬ 
solved  by  the  samples  of  5“(os).  The  FT  magnitude  |3T(w)|  is 
completely  specified  by  samples  of  |A'(<u)t  when  the  DFT 
size  is  twice  the  size  of  the  known  support  of  xfn)  in  each 
dimension. 

IV.  Examples 

The  algorithm  discussed  in  Section  III  has  been  used  to  re¬ 
construct  a  variety  of  different  l-D  and  2-D  sequences  from 
their  signed  FT  magnitudes.  In  this  section,  we  present  some 
of  these  examples. 

Fig.  4  illustrates  one  example  in  which  a  l-D  sequence  is  re- 
constructed  from  its  signed  FT  magnitude.  In  Fig.  4(a)  is 
shown  a  47-point  sequence  obtained  by  sampling  female  speech 
at  a  10  kHz  rate.  In  Fig.  4(b)  is  shown  the  sequence  recon¬ 
structed  by  using  the  iterative  algorithm  with  the  DFT  size  of 
1024  after  SO  iterations.  In  addition  to  the  above  example,  a 
number  of  other  examples  have  been  consideied.  In  all  cases, 
we  observed  that  the  algorithm  reconstructs  the  desired 
sequence. 

Fig.  5  illustrates  sn  example  in  which  a  2-D  sequence  is  re¬ 
constructed  from  its  signed  FT  magnitude.  In  Fig.  S(a)  is 
shown  an  image  of  size  2S6  X  256  pixels.  In  Fig.  5(b)  is  shown 
the  image  reconstructed  by  using  the  iterative  algorithm  using 
the  DFT  size  ofS12X$12  after  10  iterations. 


Fig.  4.  Speech  tcpmeni  sampled  at  47  points.  (•)  Original  sequence, 
(b)  Reconstructed  sequence  aflct  50  Mentions. 


(hi 

Fig.  5.  Image  of  the  25<  x  IS4  pixels,  (s)  Original  Image.  <b)  Rtcon- 
stmesed  image  after  10  iteration!. 

In  addition  to  the  examples  showti  in  this  section,  are  have 
studied  a  number  of  other  examples.  From  these  examples, 
we  have  made  the  following  observations  about  the  iterative 
algorithm.  First,  for  sequences  satis  ying  the  uniqueness  con- 


scraints,  If  a  DFT  size  below  tome  threshold  value  it  used,  the 
algorithm  does  not  lead  to  the  desired  sequence.  The  threshold 
value  is  different  for  different  sequences,  and  w«  have  not  yet 
found  a  simple  way  to  determine  the  threshold  value  for  a 
given  sequence.  In  practice,  therefore,  the  DFT  size  is  typically 
much  larger  than  the  threshold  value  to  reconstruct  a  sequence 
from  its  signed  FT  magnitude.  Second,  the  DFT  size  required 
is  typically  much  larger  (by  more  than  a  factor  of  10  typically) 
chan  the  size  of  the  data  for  1-D  signals.  For  2-D  signals,  we 
have  observed  that  the  DFT  size  of  2 N  X  2 N  when  the  data 
size  is  N  XN  is  sufficient  for  all  examples  we  considered.  This 
difference  is  in  part  due  to  the  fact  that  the  magnitude  of 
2 A'  X  2N  DFT  when  the  data  size  is  N  X  N  uniquely  specifies 
a  2-D  sequence  within  a  sign  factor,  a  translation,  and  a  central 
symmetry,  and  therefore  the  ambiguity  that  needs  to  be  re¬ 
solved  by  I  bit  of  phase  information  is  much  less  for  2-D  sig¬ 
nals  than  for  l-D  signals.  Third,  the  threshold  DFT  length  is 
approximately  the  same  for  different  choices  of  a,  as  long  as  a 
is  not  too  close  to  0  or  n.  As  a  approaches  0  or  n,  the  thresold 
length  is  significantly  increased.  The  choice  of  a  *  si 2  permits 
the  use  of  FFT  routines  specific  to  real  sequences,  and  there¬ 
fore,  uses  less  computation  time  and  less  storage  space. 
Fourth,  che  convergence  rate  of  the  iterative  algorithm  is  rapid 
initially  and  becomes  slow  as  the  number  of  iterations  is  in¬ 
creased.  Fifth,  we  have  observed  that  the  mean  square  error 
between  the  original  and  reconstiucted  sequences  decreases 
monotonically  as  the  number  of  iterations  increases.  Sixth, 
the  convergence  rale  of  the  algorithm  can  be  significantly  im¬ 
proved  by  using  an  acceleration  procedure  similar  to  that  used 
by  Oppcrdirim  er  al.  |  13J.  Further  details  on  the  behavior  of 
the  iterative  algorithm  can  be  found  in  Van  Hove  (91. 

V.  Conclusions 

In  this  paper,  we  have  shown  that  a  1-D  or  MD  sequence  is 
uniquely  specified  urdcr  mild  restrictions  by  its  signed  FT 
magnitude.  In  addition,  we  have  developed  an  iterative  algo¬ 
rithm  to  reconstruct  a  1-D  or  MD  sequence  from  its  signed  FT 
magnitude.  When  this  resul*  is  combined  with  the  previous  re¬ 
sult  |S)  on  the  problem  of  reconstructing  a  1-D  or  MD  se¬ 
quence  from  its  FT  phase,  we  obtain  a  very  general  result  that 
a  1-D  or  MD  sequence  is  uniquely  specified  by  its  FT  phase  or 
its  signed  FT  magnitude.  In  addition,  under  mild  restrictions, 
an  iterative  algorithm  which  is  similar  in  style  can  be  used  to 
reconstruct  a  1-D  or  MD  sequence  from  its  FT  phase  or  signed 
magnitude. 

Appendix 

Statement  Al:  Let  x(n)  and  v(n)  be  two  real,  causal,  and 
finite  extent  sequences.  If  |Af(w)l  *  I  F(oj)1,  x(n)  and  y(n) 
can  always  be  expressed  as 

*(n) »  b(n)  •  «(n) 

y(n)  •  eb(n)  •  j(Af  -  I  -  n) 

where  c  ■  ♦!  or  - 1  and  a(n)  and  b(n)  are  real,  causal,  and 
finite  extent  with  N  corresponding  to  the  length  of  a(n),  i.e., 
«(n)  ■  0  outside  0  <  n  <  N  -  I. 

Proof:  A  general  expression  of  the  z  transform  X(z)  of  a 
sequence  x(n)  which  is  causa]  and  has  a  finite  support  is  given 


by 

jr(») n  (ai  d 

l-l 

where  zt,  i  *  1 , 2,  •••.{?.  are  the  2eros  of  3T(z),  x»  is  the  fust 
nonzero  sample,  and  tt ,  is  the  positive  initial  delay  in  x(n).  It 
is  well  known  that  the  FT  magnitude  of  a  finite  extent  1-D  se¬ 
quence  remains  unchanged  only  when  the  sequence  is  subject 
to  linear  shifts,  sign  inversions,  and/or  zero  “flipping."  The  z 
transform  V(z)  may  therefore  be  written  as 

y(z)-*z-"»x,  n  (• '  *(*■’)  n  (-**♦*■*) 

!€{»}  reW 

(AI-2) 

where  n,  is  the  positive  initial  delay  in  >-(n),  {r}  is  the  set  of  in¬ 
dexes  of  the  R  zeros  of  F(z)  which  are  zeros  of  A'fzl  reflected 
across  the  unit  circle,  a:;d  (it)  is  the  set  of  indexes  of  zeros 
which  are  unchanged  from  X(z)  to  >’(*)-  We  may  also  write 
(A!.l)and(A1.2)as 

Jf(z)  =  A(z)-B(z) 

Y(z)*±C(z)  B(z) 
or 

x(")  =  «(»)*  b(n) 

y(n)*tf(n)»b(n)  (Al  3) 

where 

^(z)«z-‘«i-"»>  n  (1  -  ztz~') 

«{') 

ff(z)«z"‘»x o  n  (I  ‘  *!*'') 

«={.} 

C(z)=  I]  (*.  +  0-  (A  14) 

We  now  show  that  c(n)  is  o(n)  time  reversed,  represented  by 
•'(■*)-  The  length  of  the  sequence  e’(n)  is  AT  *  nt  -  n2  ♦  R  +  1, 
if  we  include  the  leading  zeros.  Therefore, 

•  I  -  n) 

A’(z)  *  A(z',)z*<,v'  «  z‘*  I]  (I  -  Z|Z)»C(z) 

<«{') 

so  that  c(n)  =  e(Af-  I  -  n).  From  (A1.3),  the  sequences x(rr) 
and  y(n)  are  expressed  in  the  adequate  form.  To  characterize 
«(n)  and  b(n),we  examine  their  z  transforms.  Since  B(z)  con¬ 
tains  only  a  finite  number  of  negative  powers  of  z,  the  sequence 
6(n)  has  a  finite  causal  support.  Since  A(z)  and  A  ’(•*)  *  C(z) 
contain  only  negative  powers  of  z.  it  follows  that  a(n)  and 
*(/V-  I  -  n)  are  causal  so  that  a(n)  is  zero  outside  0<n< 
N-  I .  If  the  z  transform  X(z)  contains  a  pair  of  complex  con¬ 
jugal  e  zeros,  than  they  must  both  belong  to  {«}  or  both  to  {/} 
for  y(n)  to  be  real-valued.  The  z  transforms  A(z)  and  B(z) 
may  therefore  contain  complex  zeros  only  in  conjugate  pairs  so 
that  a(n)  and  bfn)  are  real,  in  the  case  rtj  >n , .  we  simply  ex¬ 
change  the  roles  of  x(n)  and  y(n).  Thu  completes  the  proof  of 
Statement  Al. 
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Statement  A2  Let  b(tt)  be  a  res!.  causal,  and  finite  extent 
sequence.  For  any  positive  integer  A,  the  equation 

Re{«e)a*(A'*"/,l,.,  M-0 

b  satisfied  for  at  least  P  distinct  values  of  or  in  the  interval  R 
where  P  and  R  are  as  defined  in  (7)  of  the  text. 

To  prove  the  statement,  we  introduce  the  notion  of  un¬ 
wrapped  phase.  Given  a  Fourier  transform  W(«)  which  has  no 
zeros,  we  define  its  unwrapped  phase  Par(ur)  as  the  unique 
continuous  function  of  os  which  satisfies 

Affor)  *  IM(u)le‘**M  (A2.I ) 

for  all  or  and  which  takes  the  value  of  0  or  -x  at  or  »  0.  The 
unwrapped  phase  has  the  following  properties.  If  we  define 
the  function  F(u)  as 

F(<o)  =  D(u)B(cj)  .  (A2.2) 

then  if  follows  that 

Pf(cj)  =  d0(or)  ♦  dfl(w)  ♦  2ar 


Fi t-f.  Unmapped  phase  of  the  tunznon  F(w)  lot  He)  •  *(»).  (j)  At  • 
4.  MM’S. 

Is  utisfied  for  at  least  P  distinct  values  of  or  in  the  interval  /*, 
then  it  is  identically  equal  tc  zero  and  e(n)  *  x(A  -  I  -  n).  P 
and  R  are  defined  as  in  (7)  in  the  text  as 

P  ■-*'-  *  and  ti  » (0, »)  for  A  odd 

yy 

^=y- andfl*(0,  xl  for A even 

Proof  /  »  A  Odd:  With  the  use  of  trigonometric  formulas, 
we  obtain 


where 

o=l  if  do(0)  =  dB(0)  =  -» 

0  otherwise.  (A2  3) 

The  unwrapped  FT  phase  ds(or)  of  a  causal  sequence  b(n) 
satisfies 

(A2.4) 

The  unwrapped  phase  of  the  function 

/>(or)  =  e*'w(A'*0'^  (A2.S) 

is 

♦oM  *  -or  — y- .  (A2.6) 

We  now  proceed  to  the  proof  of  statement  A2.  We  consider 
the  unwrapped  phase  pp{o>)  of  the  function 

The  equation  Re(F(or))  *  0  has  the  same  roots  is  the  equation 

py-(ui)  =  y +*x.  with  k  an  integer, 

since  F(ej)  has  no  zeros.  From  our  previous  discuvion,  we 
have 


G(w)=  lm(A(a))e/••'(•v•,w,) 

»  V  a(n)  sir  (~r—  *  «)  or  (A3.1) 

v  w,i  /TV- I  \  is-  I  \\ 

2;  *V"T~ 

(A3.2) 


Since  the  set  of  the  (A  -  I V2  functions  sin  to,  sin  2  w,  * ",  sin 
(A  -  I)tu/2  is  a  Chebyshev  set  on  the  interval (0.  x)  as  is  shown 
in  (9]  and  since  C(u»)  has  at  least  (A  -  l)/2  distinct  roots  in 
the  interval  (0,  x),  it  follows  that  the  coefficients  of  the  ex¬ 
pansion  in  the  right  hand  side  of  (A3.2)  must  vanish 


=  0; 


is  •  1,2,  ••  •, 


A-  I 
2 


or 

e(/r)«x(A- 1  -  n);  n  =  0,  ■  ,A.*  I. 


♦iK»)  *  df(0)  *  du(») '  da(0)  ♦  do(*J  *  dr/0) 


Since  the  continuous  function  dyfor)  decreases  at  least  by 
(A-  l)/2  x  on  the  interval  /?,  it  follows  that  the  graph  of 
dx-(oi)  crosses  at  least  A/2  lines  of  phase  x/2  ♦  kn  in  (0,  x|  if 
A  is  even  and  at  least  (A  -  I  )/2  such  lines  in  (0,  x)  if  A  is  odd. 
Fig.  6  shows  df(or)  when  6{n)  -  f  (n),  for  the  cares  A  ■  4  and 
A- 5. 

Statement  AS:  Let  a(n)  be  a  real  valued  sequenced  which  is 
zero  outside  0  <  «  <  A  -  1.  If  the  equation 

lmU(z)z<-w-'»'»:f.,Ao}«0 


When  A  is  even,  the  expansion  of  G(or)  is 

•sin  (n*  or. 

Since  the  functions  sin  or/2,  sin  3or/2,  •  •  • ,  sin  A-  1/2  or 
form  a  Chebyshev  set  on  the  interval  (0,  x]  as  is  shown  in 
(9) ,  it  follows  that 


n)-.(f-)  =  0;  n  »  0,  t ,  •  •  • ,  y  * 
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IHI 


or 


I  *  ");  ,N- l. 

Thii  completer  the  proof  of  Stalemenl  A3. 
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The  phsse-retiicval  problem  lor  discrete  multidimensional  Heidi  is  investigated.  In  particular,  a  recursive  proce¬ 
dure  is  developed  for  reconstructing  a  signal  from  the  modulus  of  its  Fourier  transform.  The  information  neces- 
aary  to  begin  the  recursion  is  the  boundary  values  of  the  signal.  Although  it  ia  not  arc-ays  possible  to  determine 
these  boundary  values  from  Fourirr  modulus  data  only,  if  the  sequence  has  a  reg:on  of  support  with  a  certain  geom¬ 
etry  then  these  boundary  values  can  be  determined.  These  geometries  represent  t  generalisation  of  the  conditions 
for  off-axis  holography. 


1.  INTRODUCTION 

The  reconstruction  of  a  signal  from  the  magnitude  of  its 
Fourier  transform,  generally  referred  to  as  the  phase-retrieval 
problem,  arises  in  a  variety  of  different  contexts  and  appli¬ 
cations  and  within  such  diverse  Fields  as  crystallography,  as¬ 
tronomy,  optics,  and  signal  processing.1-1  There  are  three 
fundamental  issues  involved  in  the  phase-retrieval  problem: 
the  uniqueness  of  the  solution,  the  development  of  algorithms 
for  reconstructing  a  signal  from  the  magnitude  of  its  Fourier 
transform,  and  the  sensitivity  of  the  reconstruction  to  mea¬ 
surement  errors  and  computational  noise.  In  this  paper  at¬ 
tention  is  focused  on  the  reconstruction  problem.  More 
specifically,  following  a  brief  review  in  Section  2  of  some  recent 
results  concerning  the  uniqueness  oi  the  solution  to  the 
phase-retrieval  problem  for  discrete  two-dimensional  signals, 
a  recursive  solution  to  the  pnase-retrieval  problem  is  devel¬ 
oped  in  Section  3.  This  recursive  algorithm  is  similar  to  other 
phase-retrieval  algorithms  in  the  sense  that  some  signal  in¬ 
formation,  other  than  the  magnitude  of  ita  Fourier  transform, 
is  assumed  to  be  known.1  -*  Specifically,  this  recursive  algo¬ 
rithm  assumes  knowledge  of  what  we  presently  define  as  the 
boundary  value,  of  the  signal.  Although  it  it  not  always  the 
case  that  the  boundary  values  of  a  two-dimensional  signal  are 
known,  it  is  shown  in  Section  4  that,  in  some  cases,  the 
boundary  values  of  a  signal  may  te  determined  from  the  given 
Fourier-transform  magnitude  information.  In  particular,  it 
is  shown  that  if  a  two-dimensional  sequence  has  a  region  of 
support  with  a  certain  geometry,  then  the  boundary  values  of 
the  sequence  may  be  easily  recovered.  These  geometries 
represent  a  generalisation  of  the  conditions  for  oft-axis  ho¬ 
lography. 


2.  PHASE  RETRIEVAL 

In  order  to  develop  the  recursive  phase-retrieval  algorithm 
in  Section  3,  tome  notation  and  terminology  related  to  discrete 
two-dimensional  signals  are  necessary.  The  required  back¬ 


ground  it  therefore  provided  in  Section  2.  A.  In  addition,  some 
recent  refills  concerning  the  uniqueness  of  the  solution  to  the 
phase-retrieval  problem  are  briefly  reviewed  in  Section  2.B. 

A.  Notation  and  Terminology 

A  two-dimensional  sequence  is  a  function  of  two  integer 
variables  m  and  n,  which  is  denoted  by  x(m,  n).  The  two- 
dimensional  z  transform  of  x(m,  n)  is  denoted  by  XUi,  tj) 
and  is  defined  by 

XUi,r2)~  £  £  x(m. n)2i""r J-",  fl) 

where  z  i  er.d  r  2  are  complex  variables.  The  two-dimensional 
Fourier  transform  of  x(m,  n)  is  equal  to  the  i  transform  of 
x(nt,  n)  evaluated  a'ong  the  unit  bi-disk  |-e i |  “  |r2|  »  1  and 
is  given  by 

X(e>“‘.t,',t  *  £  £  (2) 

where  ui  and  u>j  are  real  variables  that  represent  the  spatial 
frequencies  of  the  two-dimensional  Fourier  transform. 
Written  in  polar  form,  X («’'■'>,  r'»»)  it  expressed  in  terms  of 
its  magnitude  and  phase  as 

A'fe'-i,  e'-»)  "  e"')|e"*,-'1'*'*>.  (3) 

Thus  tbe  phase-retrieval  problem  is  concerned  with  the  re¬ 
covery  of  z(m,  n)  given  only  the  spectral  magnitude  function 
|.V(e">,  e'-')| 

The  two-dimensional  sequences  considered  in  this  paper 
are  assumed  to  be  real  valued  and  to  have  finite  support.  i.e., 
x  (m,  n )  is  real  and  nonzero  for  only  a  finite  number  of  values 
of  the  ordered  pair  (m,  n).  For  convenience  it  is  assumed, 
without  any  loss  in  generality,  that  a  sequence  with  finite 
support  has  first  quad.ant  support,  Le.,x(m,n)  “  0 if  m  <  0 
or  if  n  <  0.  In  addition,  if  it  is  known  that  x(m,  n)  is  rero 
outside  the  rectangular  region  R<  Af ,  ,V)  containing  all  points 
(m,  n)  for  which  0Sm<M  end  0  S  n  <  N,  i.e., 
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X(m.ao*(o,m-i:x|3.a.,-U.  m> 

where  X  is  used  to  denote  the  Carteaian  crow  product*,  eg., 
then  x(m,  nl  i*  said  to  have  aupport  H(M,  N) 

Since  the  two-dimensional  z  tnenfoc®  of  a  sequence  with 
first  quadrant  aupport  i*  a  polyrv:  ial  in  the  two  varieblea, 
a i_l  and  rj"1,  X(*i. r j)  may  alwayo  t;#  uniquely  written  (to 
within  factors  o!  aero  degree)  as  a  piucV  of  polynomials  that 
are  irreducible  over  the  field  of  comp;  .*?  turn  hers10 

X(zi,  zj)  “  nri"">r j""*  11  Xe<*i.*z).  (5) 

a- 1 

whet  e  or  is  a  real  number  and  nt  and  /it  are  nonnegative  in¬ 
tegers.  The  irreducible  factors  X»(Z|,  r2),  which  may  be  of 
arbitrarily  large  degree,  are  the  two-dimensional  counterpart 
of  the  linear  factors  that  define  the  zero*  of  the  :  transform 
of  a  one-dimensional  sequence.  Clearly,  since  these  irre¬ 
ducible  factors  are  polynomials  in  two  variables,  the  zero  seta 
of  two-dimensional  z  transforms  are  contours  in  the  rj-Zj 
plane. 

B.  Uniqueness 

An  important  issue  in  the  phase-retrieval  problem  is  the  un¬ 
iqueness  of  the  solution,  ft  is  well  known  that,  without  any 
additional  information  or  constraints,  a  signal  (discrete  or 
continuous,  one-dimensional  or  multidimensional)  it  not 
uniquely  specified  by  the  magnitude  of  its  Fourier  trans¬ 
form.1^'11  The  absence  of  a  unique  solution  stems  from  the 
fact  that  it  is  always  possible  to  convolve  a  signal  with  an  ar¬ 
bitrary  all-pas*  signal  (one  that  has  a  Fourier  transform  with 
unit  modulus)  to  obtain  another  signal  with  the  same  spectral 
magnitude.  As  a  result,  the  ability  to  incorporate  some  ad¬ 
ditional  information  or  'cnowledge  about  the  signet  to  con¬ 
strain  the  set  of  admissible  solutions  is  necessary  in  order  to 
obtain  a  unique  reconstruction.  Since  many  of  the  signals 
that  are  of  practice!  interest  are  of  finite  duration  or  extent, 
a  finite  support  constraint  is  often  used  in  phase-retrieval 
algorithms.1^*  As  a  result,  the  uniqueness  of  the  solution  to 
the  phaae-retrit  -al  problem  has  been  considered  for  the  case 
in  which  the  solution  is  constrained  to  be  ol  finite  length  or 
to  have  finite  support.  Unfortunately,  however,  it  has  been 
shown  that  for  one-dimensional  signals  (either  continuous  or 
discrete)  such  e  constraint  is  not  sufficient  to  ensure  t  unique 
solution  because  of  the  possibility  of  zero  flipping.**' 1  For 
two-dimensional  signals  with  finite  support,  on  the  other 
hand,  the  uniqueness  results  ere  considerably  different.  Al¬ 
though  the  uniqueness  properties  are  not  well  understood  for 
the  continuous  case,  considerable  progress  has  been  made  for 
the  discrete  case.  In  perticultr,  it  has  been  shown  that  the 
two-dinwnsionsl  counterpart  of  zero  flipping  in  the  discrete 
one-dimensional  case  is the  flipping  of  the  zero  contours  of  the 
irreducible  polynomials  that  define  the  two-dimensional  z 
transform  of  the  sequence. J  It  follows  therefore  that.  if  X(zi, 
zj)  is  an  irreducible  polynomial,  then  i  (m.  n )  is  uniquely  de¬ 
fined  by  its  spectral  magnitude  to  within  the  trivial  am¬ 
biguities  of  a  linear  shift,  a  reflection  cf  the  sequence  about 
the  origin,  or  by  a  scale  fector  of  (-1).  More  specifically,  note 
that,  if  twoaequencts  i(s,  n)  and  y(m,  n)  arc  related  by 

yfm.n)  *  ±i(±m  +  *,  ±n  +  f)  (6) 

for  some  integer*  k  «nd  I,  then  x(m,  n)  and  y(m,  nl  h»v» 
Fourier  transforms  with  the  same  magnitude.  Therefore  any 
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two  sequsoce*  related  by  Eq.  (S)  are  said  to  be  squ-valsat,  and 
this  equivalence  relation  is  denoted  by 

x(m,/i)~-y(m,n).  (7) 

With  this  relation,  the  uniqueness  result  of  interest  is  the 
following3: 

Theortm  1:  Let  x(m,  n)  be  a  two-dimensional  sequence 
with  finite  support  that  has  t  two-dimensional  z  transform 
that,  except  for  trivial  factors  of  the  form  aZ|~*‘ij~*».  it  ir¬ 
reducible.  If  y  (m,  n)  is  another  two-dimensional  sequence 
aoth  finite  support  with  |  Vie'*1.  */-»)|  m  |X(c1*'1.c"»;|  for 
all  ui  and  s>j,  then  y(/n,  n)  ~  x(m.  n). 

It  should  be  pointed  out  that  the  requirement  that  X(*i, 
Z2>  be  irreducible  is  not  i  particularly  strong  constraint. 
Specifically,  it  mey  be  shown  that  within  the  set  of  ell  two- 
dimensional  sequences  with  finite  support  the  sulieet  of  all 
sequences  that  bsv*  reducible  z  transforms  is  a  set  of  measure 
zero.3-13  As  r  result,  almost  all  two-dimensional  sequences 
with  finite  support  will  satisfy  the  irreoucibility  requirement 
of  Theorem  1.  frreducibility  of  the  z  transform  of  a  two- 
dimensional  sequence  mey.  iu  fact.  b»  guaranteed  with  the 
proper  placement  of  point  source*  outside  the  sequence's  re¬ 
gion  of  support.11 

One  limitation  of  Theorem  1  is  that  it  require*  that  the 
magnitude  of  the  Fourier  transforms  of  i(m,n  I  and  y(m,n) 
be  equal  for  all  values  of  U|  and  sjj.  Fortunately,  however. 
Theorem  1  mey  be  extended  so  that  the  magnitudes  of  the 
Fourier  transforms  of  x(m,  n)  and  >  (m.  n )  need  only  be  equal 
for  a  finite  number  of  values  of  u>t  end  vj.  The  number  of 
points  for  which  the  Fourirr-transform  magnitudes  musi  be 
equal  ia  determined  by  the  size  of  the  regions  of  support  of 
x(m,  n)  and  yfin,  n),  whereas  the  locations  of  the  sample 
points  in  the  -m-**;  plane  are  constrained  to  lie  on  a  regular 
lattice.  Specifically3: 

Theorem  2:  Lei  i (m  n)  and  ytm,  n)  be  two-dimensional 
sequences  with  support  fl(.tf,  ,V).  If  a*  for  k  «  1, ....  M  and 
hi  for  1  »  1, . . . ,  N  are  distinct  real  numbers  in  the  interval 
<0,  xland  if 


|.V(e">,  e"»)|  -  |y<f»«>.ere»)|  for 

«z  “  hi,  h*. . . . .  t# 

(8) 

then  y(m,n)~x(m,  n). 

A  ip  jciel  case  of  this  theorem  result*  when  the  points  a*  and 
h|  are  uniformly  spaced  between  0  and  tr.  In  this  instance  in 
particular,  the  condition  contained  in  Eq.  (8)  is  equivalent  to 
the  constraint  that  the  magnitude  of  the  2 M  X  2 .V  point 
two-dimensional  discrete  Fourier  transforms  of  x(m,  n)  and 
y(m,  n)  art  equal. 

3.  RECUFSIVE  PHASE  RETRIEVAL 

As  was  stated  in  Section  2,  there  exists  a  rich  end  useful  dose 
of  two-dimensional  sequences  that  are  uniquely  defined  to 
within  some  trivial  ambiguities  by  the  magnitudes  of  their 
Fourier  transforms,  eg.,  the  clast  of  two-dimensional  se¬ 
quences  that  Imve  finite  support  and  irreducible  z  transforms. 
In  spite  of  this  uniqueness  result,  however,  the  reconstruction 
of  a  two-dimensional  sequence  from  its  spectral  magnitude 
remains  a  difficult  problem  in  the  absence  of  any  additional 
information  or  constraints.  Therefore  a  number  of  different 
algorithms  have  been  proposed  that  incorporate  additional 
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signal  information  or  conet/ainU.  Gerthberg  and  Saxton,  for 
tiampU,  developed  an  iterative  algorithm  that  aisumes,  in 
addition  to  apcclra!  magnitude,  information  about  the  mag¬ 
nitude  of  the  sequence  s(m,  n),  which  wee  assumed  to  be  a 
complex-valued  function  of  m  and  n  .*  Fienup,  on  the  other 
hand,  has  considered  an  iterative  algorithm  that  incorporates, 
in  addition  to  a  finite  support  constraint,  a  positivity  con¬ 
straint  on  x(m,n|*  As  yet  another  example,  Hayes1  and  Van 
Hove  el  of.  *  have  investigated  iterative  phase-retrieval  algo¬ 
rithms  from  signed  Fourier-transform  magnitude,  t.e.,  Fou- 
rier-tra.isform  magnitude  akng  with  one  bit  of  phase  infor¬ 
mation.  In  this  section,  a  recursive  solution  to  the  phase- 
retrieval  problem  is  developed  for  reconstructing  a  two-di¬ 
mensional  sequence  from  its  two-dimensional  autocorrelation 
function  rim.  n)  when  the  boundary  values  of  x(m,  n)  are 
known.  Thus  this  algorithm  is  similar  to  those  mentioned 
above  in  that  some  information  in  addition  to  the  Fourier- 
transform  magnitude  is  assumed  to  be  known  about  x(m,  n). 
In  this  case,  the  additional  information  that  is  included  con¬ 
sist*  of  the  boundary  values  of  x(m,  a ). 

A.  Development  of  the  Algorithm 
Consider  an  arbitrary  two-dimensional  sequence  x(m,  n) 
whose  nonzero  values  are  contained  within  the  rectangular 
region  R(Af.A').  as  shown  in  Fig.  1(a).  For  convenience,  it  is 
assumed  that  ItlM.  N)  is  the  smallest  possible  rectangle  that 
contains  alllhe  nonzero  values  of  x(m,  n|.  Therefore  along 
each  edge  of  R(Af .  IV)  there  is  at  least  one o.dered  pairlm.n) 
for  which  r(m.  n )  is  nonzero.  The  boundary  of  x(m,  n)  is 
therefore  defined  as  the  collection  of  all  the  points  of  x  (m ,  n ) 
that  lie  along  the  edges  of  R(Af .  N). 

The  autocorrelation  of  x(m,  n).  denoted  by  r(m,  n),  is  given 
by 

r(m.n)  -  x(m.n)  ••  x(-m.-n) 

St-I  N-l 

-EE  x(*,f)K(m  +  *.*t  +  /),  (9) 

*-0  1-0 

where  x  *  •  y  is  used  to  denote  the  two-dimensional  convolu¬ 
tion  of  x  ai  i  y.  Knowledge  of  the  squared  magnitude  of  the 
Fourier  transform  ofx(m,  n(  is  equivalent  to  knowledge  of  the 
autocorrelation  r(m,  n)  since  they  form  a  Fourier-transform 
pair.  Clearly,  the  support  of  rlm,  n)  is  contained  within  the 
rectangular  region  defined  by  (—A/  -f  l ,  Af  -  I|x(-/V  +  1, 
N  —  1|.  as  shown  in  Fig.  Kb).  Furthermore,  since  t(m.n)  is 
real,  rim,  n)  is  symmetric  about  the  origin,  i.e..  rlm,  n)  » 
r(-m,  -n). 

In  addition,  note  that 

rim.K-  1)  -  *Elx(*,0)  x(m  +  *,,V  -  1) 

*-o 

-  x(m.O)  •  t(-m,N  -  1)  (10a) 

and 

A'-l 

r(M-I,n)«  £  x(0,(>  X  (Af  -  J,n  + 1) 
i-o 

-*(0,n).x(Af- !.n).  (10b) 

where  x  •  y  it  used  to  denota  the  one-dimensional  convolution 
ofx  andy.  With  the  boundary  of  r{m,n)  defined  as  the  col¬ 
lection  of  all  the  points  of  r(m ,  n )  that  lie  along  the  edges  of 
iu  tegion  of  support,  note  that  Eqs.  (10)  assert  that  the 
boundt /y  values  of  rim,  n)  may  be  determined  front  the 
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Fig.  1.  la)  A  region  ol support,  KIM,  N ).  for  a  two-dimensional  se¬ 
quence.  (b)  The  region  of  support  of  its  autocorrelation. 


boundary  values  of  the  sequence  xtm, «).  The  recovery  of 
the  boundary  value*  of  iim,  n)  from  the  boundary  values  of 
r(m,n ).  however,  is  a  nonlinear  problem  that,  in  the  absence 
of  any  additional  information,  may  not  have  a  unique  solution. 
Suppoae,  however,  that  the  boundary  values  of  x(m,  n)  are 
known  (the  determination  of  the  boundary  values  from  r(m, 
n)  is  addressed  in  Section  4|.  More  specifically,  for  k  •  0. 
1 . A'  -  I.let 

x»(m)-x(m,hl  form -0,1 . Af  -  1  (11) 

be  used  '.o  denote  the  one-dimensional  sequence  that  corre¬ 
sponds  to  the  fcth  row  of  the  two-dimensional  sequence  x(m, 
n  1  as  shown  in  Fig.  1 .  The  boundary  values  of  x  (m ,  n )  thus 
include  the  first  and  the  last  rowi  of  x(m,  n  I,  which  are  de¬ 
noted  by 

<z(m)  -  x0(m).  d(m)  -  x«_i(mi,  (12) 

as  well  a*  the  first  and  the  last  columns  of  x(m,  n).  which 
correspond  to  the  first  and  last  values  of  each  sequence  x*  (m ), 
i.e.,  x»(0)  and  x»(Af  -  l).  Now,  with  r*(m)  »  rim.  k)  used 
to  denote  the  kth  row  of  the  autocorrelation  sequence,  as 
shown  in  Fig.  1(b),  note  that 

£  x,v-t(*)o(m  +  A)  +  E  d(*)xi(m  +  k)  -  r.v-;(m) 

*-o  *-t 

(13a) 

or 

Xjv-j(m)  •  r>(— m)  +  dim)  •  X|(-m)  -  ^.j(m).  03b) 

(R»c  ’ll  that  •  denotes  convolution.)  Therefore,  with  aim ), 
dim),  and  r.v-jlm)  known,  Eqs.  03)  represent  a  set  of  2M  - 
1  linear  equations  in  the  unknowns xf(m)  and  x.v-j(m),  i.e.. 
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Fie.  2.  System  interpretation  of  the  linear  equetiuru  that  define  the 
recursive  phase  retrieval  algorithm. 


the  values  of  x  (m.  n)  in  rows  I  and  S  -  2.  A  system  inter¬ 
pretation  of  the  set  of  linear  equations  given  by  Eqs.  (13)  is 
shown  in  Fig.  2.  Specifically,  Eqs.  (13)  define  the  sequence 
r.v-  2(m  I  as  the  sum  of  the  outputs  of  two  linear  shift-invariant 
(one-dimensional)  systems  with  unit  sample  responses  o(-m) 
and  (3<m)  that  are  driven  by  the  inputs  x.v-.lm )  and  xt(-m ), 
respectively.  The  goal  is  to  recover  the  unknown  values  of  the 
signals  x.v-.lml  and  X|(-ml  from  the  available  known  in¬ 
formation,  i.e.,  from  the  signal  r(m,  n)  and  the  boundary 
values  of  x(m,n)  Recall,  however,  that  the  boundary  values 
of  x  (m ,  n )  include  the  first  and  the  last  rows  of  x  (m ,  n  I,  which 
correspond  to  the  unit  sample  responses  of  the  two  filters  in 
Fig.  2,  as  well  as  the  first  and  the  last  columns  of  x(m,  n), 
which  define  the  initial  and  the  final  values  of  the  inputs  to 
these  filters,  i.e.,  Xi(0).  x((Af  -  1 ).  x/v-j(0).  and  x.v-i(Af  - 
1). 

In  order  to  investigate  the  solution  to  Kqs.  (13),  let  us  in¬ 
troduce  the  vector  notation 

*„  -|x,(0).x„(ll . x„fAf-l)|.  (Ma) 

r.  -  [r„(l  -  Af),  r„(2  ~  Af) . r„(Af  -  l)|.  (Mb) 

Thus  Eqs.  (13)  may  be  written  in  matrix  form  as 

[x.v-,] 


IC  ; 


f.v-s! 


where  A  and  B  are  (2A f  -  1)  X  Af  convolution  matrices.  As 
an  example,  for  a  sequence  with  support  ft(3, 3),  i.e.,  xlm,  n) 
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o(m)  and  xn-i(ai)  «  Bim).  the  first  two  rows  and  the  last  two 
rows  of  x(m,  n)  are  now  specified. 

Now  suppose  that  the  first  (*  -  1)  rows  and  the  last  (*  - 
1 )  rows  of  x  (m,  n)  are  known,  i.e.,  aim )  and  t)( m  >  along  with 
X{(m)andx/v-i-i(m)forf  “  1,2,...,*  -2  Then,  as  in  Eqs. 
(13),  a  set  of  linear  equations  defines  the  unknown  values  in 
the  aequnces  x»-i(m)  and  xw-*(m>.  Specifically, 

rn~i i(m)  ”  x*-*(m)  •  a(-m)  +  Bim)  •  x*-i<~m) 


*-Z 

+  £  XM-»v|(m)  aX|(-m), 
i- 1 


(17) 


which  may  be  rewritten  as 

xw-*(m)  v  a(-m)  +  0(m)  »x*-i(-m)  «  f.v-*(m),  (18) 

where 


fn-t(m)  ■ 


rjv_*(m)-  £  x/v_»*i(m)  •  xi(-m)  (19) 


is  a  vector  consist  ing  of  known  autocorrelstion  values  r.v-*  ( m ) 
and  sums  of  correlations  of  previously  computed  rows  of  x(m, 
n).  In  matrix  form,  Eq.  (18)  becomes 

I  ,20) 

where  the  matrices  A  and  B  are  identical  to  those  in  Eq.  (15). 
Thus  Eq.  (20)  provides  a  recursion  for  computing  the  rows 
x»-i  and  x.v-*  from  the  values  of  X|  and  x.v-tci  for  1  *  1, 

2 . *  -  2.  The  initial  conditions  required  to  begin  the 

recursion  are  the  first  and  the  last  rows  of  xlm,  n),  i.e.,  trim) 
and  0{n ).  Therefore,  given  the  boundary  values  of  x  (m ,  n ), 
the  entire  two-dimensional  sequence  may  be  recovered  from 
its  autocorrelation  function  by  using  the  linear  recursion  |Eq. 
(20)).  provided  that  the  linear  equations  may  be  uniquely 
solved  for  the  unknown  rows.  It  may  be  shown,  however,  that 
a  sufficient  condition  for  a  unique  solution  to  Eq.  (20)  to  exist 
is  that  <r(m)  and  Bin)  not  be  identically  zero  and  that  aim) 
(15)  notbe  related  to  BiM  -  1  -m)  by  a  constant  scale  factor.  In 
this  case,  the  unknowns  in  Eq.  (201  may  be  recovered  by  a 
pseudoinverse  matrix  operation.  One  interesting  feature 
about  the  recursion  that  should  be  pointed  out  is  that  it  re- 
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Note  that  XifOI. 

x,(3),x 

2(0),  and  X2(3)  are 

boundary  values - 

of  x  (m,  n )  and  thus  are  assumed  to  be  known.  Therefore  Eq. 
( 16)  representa  seven  linear  equations  in  four  unknowns.  In 
the  general  case,  there  ere  2M  coefficients  in  Eq.  (15)  that  are 
required  in  order  to  specify  the  vector*  it  and  x.v-?.  The 
boundary  values  of  xlm,  n),  however,  define  the  initial  and 
the  final  values  of  these  vectors.  Consequently,  Eq.  (15) 
re  presents  2Af  -  1  linear  equations  in  2Af  -  4  unknown*  For 
the  moment,  it  is  assumed  that  these  equations  msy  be 
uniquely  solved  for  x(  and  x*-j.  Thus,  including  x0(m)  - 


.  (16) 


quires  the  computation  of  only  one  pseudoinveree  matrix. 
The  recursive  solution  for  each  row  consists  simply  of  the 
computation  of  the  vector  fjv-t  in  Eq.  (201,  which  is  then 
multiplied  by  the  pseudoinverse  matrix. 

B.  An  Example 

An  example  that  illustrates  the  recursive  reconstruction  of  a 
two-dimensional  sequence  from  its  autocorrelation  function 
and  its  boundary  values  is  shown  in  Fig.  3.  In  particular,  an 
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Fig.  3.  Phut  retrieval  using  known  boundary  condition!,  (a) 
Original  image,  (b)  Keconatructed  image. 

original  two-dimensional  sequence  that  has  a  rectangular 
region  of  support  of  extent  64  pixels  by  64  pixels  is  shown  in 
Fig.  3(a).  The  sequence  that  is  obtained  from  the  recursion 
(Eq.  (20)]  by  using  double-precision  arithmetic  is  shown  in  Fig. 
3(b)  and  is  indistinguishable  from  the  original.  Although  the 
recursive  phase-retrieval  algorithm  successfully  reconstructed 
the  two-dimensional  sequence  in  this  example,  this  is  not  al¬ 
ways  the  case.  In  particular,  although  it  has  been  observed 
that  the  recursion  is  well  suited  for  reconstructing  two-di¬ 
mensional  sequences  that  have  small  region*  of  support,  e.g., 
ft(M,  N)  with  M  <  64  and  N  <  64.  because  of  the  recursive 
nature  of  the  algorithm  the  teconstruction  is  quite  sensitive 
to  errors  that  arise  from  computational  noise.  Specifically, 
whereas  the  reconstruction  of  large  two-dimensional  se¬ 
quences  is  accurate  in  the  initial  stages  of  the  recursion,  the 
propagation  of  computational  noise  through  the  recursion 
decreases  the  accuracy  of  the  reconstruction  at  the  recursion 
progresses.  Nevertheless,  in  reconstructing  a  two-dimen¬ 
sional  sequence  that  has  a  large  region  of  support,  it  is  possible 
to  consider  using  the  recursion  to  reconstruct  a  small  number 
of  roam  and  columns  (which  may  be  done  with  a  high  degree 
of  accuracy)  and  then  use  an  iterative  procedure  in  the  etyla 
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of  Carchbarg  and  Saxton  that,  in  the  spatial  domain,  incor¬ 
pora  tea  the  known  boundary  values  and  the  recursively 
computed  rows  and  columns. 

4.  COMPUTATION  OF  THE  BOUNDARY 
CONDITIONS 

Aa  was  noted  in  Section  3-A,  the  boundary  values  of  x(m.  n) 
art  related  to  the  boundary  values  of  the  autocorrelation 
function  rim.  n)  through  a  art  of  nonlinear  equations  jEqa. 
(10)1.  Although  it  hat  been  demonstrated  that  the  solution 
to  these  aquations  is  not  necessarily  unique,  there  are  cases 
for  which  the  solution  is  unique  and  for  which  the  boundary 
values  of  x(m,  n)  may  easily  be  determined.  Consider,  for 
example,  a  two-dimensional  sequence  x(m,  n)  that  is  known 
to  have  a  triangular  region  of  support,  at  shown  in  Fig.  4.  The 
region  of  support  of  the  autocorrelation  function  of  x(m.  n) 
is  also  shown  in  Fig.  4.  Note  that  the  three  corner  points  of 
x(m,  n)  are  related  to  one  another  by  the  following  three 
second-order  equationa: 

r(Af,0-x(Af.O)x(0.0>. 

r(0.  N)  •  x(0,  N)x(0, 0), 

r(Af,  -N)  •  x(0,N)x(M,0).  (21) 

By  assuming  that  x(0, 0),  x(M,  0),  and  x(0,  N)  are  nonzero, 
the  solution  to  Kqs.  (21 )  is  easily  shown  to  be  unique  to  within 
a  sign.  F urthc rmore.  once  these  corner  points  are  found,  the 
entire  boundary  ofx(m,  n)  may  easily  be  recovered  since  the 
boundary  values  of  x(m,  n)  are  proportional  to  the  boundary 
values  of  r(m,  n )  e.g  ,  x(m,  0) «  r(m.-N)/i(0,  N)  for  m  »  0. 
I, . . .  ,  M.  Therefore  two-dimensional  sequences  that  are 
known  to  have  a  triangular  region  of  support  may  be  easily 


Ibl 

Fig.  4.  (s)  A  triangular  region  of  support  for  a  two-dimentiontl  M- 
quenre.  (b)  Tbs  region  of  support  of  its  autocorrelation. 
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Fig.  5.  The  division  of  the  two-dimension*!  plane  into  eight  regions 
and  the  rectangular  region  HiM.N). 

reconstructed  from  only  their  autocorrelation,  provided  that 
the  amplitudes  of  the  corner  points  are  nonzero. 

Sequences  with  a  triangular  region  of  support,  however,  are 
a  special  case  of  a  more-general  class  of  sequences  for  which 
the  boundary,  and  hence  the  entire  sequence,  may  be  recon¬ 
structed  from  its  autocorrelation.  In  particular,  consider  a 
two-dimensional  sequence  that  i,.«s  a  rectangular  region  of 
support  R(Af.  N)  and  suppose  that  the  remaining  two-di¬ 
mensional  plane  is  divided  into  the  eight  regions  shown  in  Fig. 
S.  It  is  well  known  from  off-axis  holographic  techniques  that 
the  incorporation  of  a  point  source  sufficiently  far  removed 
from  the  region  of  support  of  x(m,  n)  will  allowx(m.n)  to  be 
reconstructed  to  within  a  scale  factor  from  its  autocorrelation. 2 
In  particular,  if  pint,  n)  «  Aim  —  k,n  —  /)  it  a  point  source  at 
m  mk,n  m  /.and  if*  £  2M-  I  or  iff  >  2N  —  1,  then  xim.nl 
may  be  trivially  reconstructed  from  rim.  n  >.  It  is  not  neces¬ 
sary.  however,  that  the  point  source  pirn,  n)  satisfy  this  sep¬ 
aration  constraint.  Suppose,  for  example,  that  pirn ,  n)  is  a 
unit  modul'is  point  source  that  lies  somewhere  wilhin  region 
1,  IH,  V.or  VII.  To  be  more  specific,  let  us  assume  that  p<m, 
n)  lies  in  region  I  and.  in  particular,  that  pirn,  n  1  ■  Aim  -  M, 
n  -  N ),  as  shown  in  Fig.  6(a).  In  this  case,  the  two  edge*  x  (0, 
n)  and  xim.O)  of  xim,  n)  (illustrated  in  Fig.  S  by  the  shaded 
region)  are  easily  recovered  from  r  (m ,  n).  In  particular,  note 
that 

r{U,N  -  n)  ■  x(O.n)  for  n  “  0, 1... .  ,N  -  1 

(22a) 

and 

r(Af  —  m,N)  •  x(m.O)  form  « 0, 1 . M  —  1. 

(22b) 

Therefore  both  of  these  edges  of  x(m,  n)  correspond  to  the 
edges  of  the  autocorrelation  sequence  r(m,  n>  With  these 
edges  of  xim,  n)  determined,  it  then  follows  from  Fx)s.  (10) 
that  the  remaining  boundary  values  of  xim,  n)  may  be  found 
by  a  simple  deconvolution  or  an  inverse  filtering  operation. 
Therefore,  if  a  point  source  of  known  amplitude  is  situated 
anywhere  within  one  of  the  four  quarter-planes  defined  by 
region  I,  III,  V,  or  VII,  it  follows  that  the  boundary  of  xim,  n  1 
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may  be  uniquely  determined  from  the  autocorrelation  r(m, 
n ),  and  therefore  it  follows  from  the  results  of  Section  3. A  that 
xim,  nl  may  be  recursively  reconstructed  from  rim.  n).  It 
is  interesting  to  note  that,  for  the  cut  in  which  the  point 
source  is  situated  at  iM.  N),  Fiddv  et  al. 11  have  shown  that 
the  s  transform  of  the  two-dimensional  sequence  (including 
the  point  source)  is  an  irreducible  polynomial  provided  that 
x(M  -  1, 0)  is  nonzero.  Therefore,  according  to  Theorems 
1  and  2,  a  unique  solution  is  guaranteed.  Note  also  that  in  this 
case  the  amplitude  of  the  point  source  pirn,  n )  need  not  be 
known.  Specifically,  aa  in  Eqs.  (21).  pim,  n)  and  the  three 
remaining  comer  points  of  x(m,  n )  are  related  by  a  set  of  four 
second-order  equations  that  may  be  uniquely  solved  for  the 
unknowns,  provided  that  they  are  nonzero. 

Consider  now  the  cate  in  which  a  point  source  lies  in  region 
I),  IV,  VI,  or  VIII.  Unlikethecasedescribedsbove.it  is  not. 
in  general,  possible  to  recover  the  boundary  of  xim,  n)  from 
the  autocorrelation  sequence  rim.n).  For  example,  consider 
a  unit  modulus  point  source  in  region  11  situated  at  (mo.  N). 


Ibl 

Fig.  6.  Point  sources  sufficient  fos  the  determination  of  the  boundary 
values  of  a  two-dimensional  sequence,  xtm,  nt.  (a)  A  single  point 
source  in  region  1  at  IM,  St  <bl  Two  point  sources,  one  in  region  II 
at  (mo.  N)  and  one  in  region  VIII  at  iM,  no). 
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In  thia caw,  the  Tint  row  of  tim.n )  it  eacily  derived  from  r(m, 
n)  linco 

r(«o  -  m.N)  “  x (m.  0)  for  m  »  0, 1, . . . ,  Af  -  1. 

(23) 

However,  without  any  additional  information,  no  other 
boundary  values  of  i(m,  n)  may  be  determined.  If,  on  the 
other  hand,  there  are  two  point  sources,  one  in  region  11  and 
one  in  region  VIII.  then  the  complete  boundary  may  be  re¬ 
covered  from  r(m,  n).  More  specifically,  consider  the  two 
point  sources  shown  in  Fig.  6(b)  that  are  located  at  (mo,  N) 
and  (Af,  n»).  If  these  point  sources  have  known  intensities, 
then  it  follows  that  the  first  row  and  the  fust  column  of  x  (m. 
n)  may  be  found  from  r(m,  n).  For  example,  if  the  point 
sources  are  of  unit  modulus,  then 

rlmo- nt.N)  “  i(m.O)  form- 0,1 . Af  -  1 


r(Af,no-n)“i(0, n)  torn1 


Note  that  the  caw  in  which  x(m,  n)  has  a  triangular  region  of 
support  corresponds  to  the  case  m0  -  n0  *  0  above  for  which 
one  point  source  is  located  in  region  II  at  (0.  Ni  and  where  one 
point  source  is  located  in  region  VIII  at  (Af,  0). 

Although  it  appears  that,  by  adding  point  sources,  we  have 
deviated  from  the  problem  originally  addressed  in  Section  3. 
the  addition  of  point  sources  is,  in  reality,  an  indirect  way  of 
defining  a  wt  of  known  boundary  conditions.  To  be  more 
specific,  let  x(m.  n)  be  a  sequence  that  has  a  region  of  support 
given  by  R(Af.  Nl  and  assume  that  this  is  the  smallest  rec¬ 
tangle  that  will  enclose  all  the  nonzero  values  of  xlm.n).  It 
follows  from  Eqs.  ( 10)  Ihsl  the  information  necessary  to  derive 
the  initial  conditions  to  begin  the  recursive  phase  retrieval 
algorithm  are  the  values  of  x(m,  n )  along  any  two  contiguous 
edges  of  A(Af,  N).  Note,  however,  thst  the  situations  con¬ 
sidered  in  Fig-  6  provide  the  information  necessary  to  specify 
these  two  edges.  In  particular.  Fig.  6(a)  corresponds  to  the 
caw  in  which  all  the  values  of  x(m,n )  along  two  of  the  edges 
of  its  region  of  support  are  xero  except  for  one  point,  i.e.,  the 
noniero  point  at  (Af .  N).  In  Fig.  6(b),  on  the  other  hand,  the 
values  of  x(m,  n)  along  two  edges  are  known  to  be  zero  except 
for  the  two  point  sources  that  are  assumed  to  have  known 
intensities. 

S.  SUMMARY 

In  this  paper  the  importance  of  the  boundary  values  of  i 
two-dimensional  sequence  in  the  phaw-retrieval  problem  has 
been  investigated.  Specifically,  it  was  shown  thst,  given  the 
boundary  values,  phase  retrieval  becomes  a  linear  problem 
that  is  amenable  to  a  simple  recursive  solution.  Furthermore, 
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although  the  determination  of  the  boundary  value*  from  only 
Fourier -transform  magnitude  information  is,  in  general,  a 
nontrivial  problem,  it  ww  shown  that,  for  regions  of  support 
that  have  certain  geometries,  the  boundary  values  may  easily 
be  found.  Thew  geometries,  in  fact,  represents  generalisa¬ 
tion  of  the  conditions  necewary  for  off-axis  holography.  An 
example  illustrating  the  recursive  phaw-retrieval  algorithm 
was  prewnted,  and  the  issue  of  the  numerical  stability  of  the 
recursion  was  briefly  diseuwed. 
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ABSTRACT 

In  this  paper,  the  'aiportance  of  spectral 
phase  and  magnitude  It  examined  from  a  different 
point  of  view.  In  particular,  an  amplitude  and 
angle  tased  representation  of  spectral 
Information  Is  developed.  With  this  formulation, 
a  causal  finite  length  signal  is  uniquely  defined 
by  Its  spectral  amplitude  or,  to  within  «  scale 
factor,  by  Its  spectral  angle. 


INTRODUCTION 

For  both  continuous-time  and  discrete-time 
signals,  the  magnitude  and  phase  of  the  Fourier 
transform  are.  In  general.  Independent  functions, 
l.e.,  the  signal  cannot  be  recovered  from  know¬ 
ledge  of  either  one  alone.  Ntth  the  appropriate 
a  priori  constraints,  however.  It  Is  possible 
that  either  the  spectral  magnitude  or  the 
spectral  phase  may  uniquely  specify  a  signal. 
For  eaample,  when  a  signal  Is  minimum  phase  or 
mailmum  phase,  tie  log  magnitude  and  phase  are 
related  through  the  Hilbert  transform.  For  dis¬ 
crete-time  sequences.  It  has  also  recently  been 
shown  that  a  finite-length  sequence  Is  uniquely 
specified  to  within  a  scale  factor  by  Its 
spectral  phase  assuming  that  the  sequence 
contains  no  yero  phase  factors  In  the  form  of 
conjugate  reciprocal  teros  [IJ.  Unlike  the  mini¬ 
mus  and  maximum  phase  constraints,  however,  there 
Is  no  dual  statement  of  uniqueness  between  a 
sequence  and  Its  spectral  magnitude  under  the 
same  set  of  conditions.  In  particular,  for  any 
finite  length  sequence  i(n)  another  finite  length 
sequence  with  the  tame  spectral  magnitude  may  be 
easily  created  by  the  we!l-kno<«i  procedure  of 
•aero-f lipping*  (2J. 

In  this  paper,  a  different  representation  of 
spectral  Information  Is  Investigated.  In  parti¬ 
cular,  an  amplitude  and  angle  representation  of 
Fourier  transforms  Is  developed.  Kith  such  a 
representation,  a  causality  constraint  Is  suffi¬ 
cient  for  a  discrete-time  signal  to  be  uniquely 


specified  In  terms  of  Its  spectral  amplitude  or. 
In  most  cases,  to  within  a  scale  factor. oy  Its 
spectral  angle.  Althougn  this  uniqueness  result 
may  be  easily  eatenteo  to  discrete  samples  of 
spectral  angle,  an  aroltrary  finite  collection  of 
spectral  maplltude  samples  Is  not  sufficient  to 
uniquely  define  a  causal  finite  length 
sequence.  Nevertheless,  sets  of  N  spectral  am¬ 
plitude  samples  may  be  found  which  provide  a 
unique  characteriiatlon  of  a  causal  sequence  of 
length  N.  Furthermore,  If  M  Is  large  enough,  the 
spectral  amplitude  of  the  M-po1nt  OFT  of  a  causal 
sequence  of  length  h  Is  sufficient  for  its  unique 
Characteriiatlon. 


SPECTRAL  AMPLITUDE  AND  ANGLE 

Let  >(n)  denote  a  one-dimensional  sequence 
and  X(w)  Its  Fourier  transform.  For  either  real 
or  complen-valued  sequences,  *(w)  Is  generally  a 
complex-valued  function  of  w  which  may  be 
written  In  polar  form  In  terms  of  Its  magnitude 
and  phase  as: 

Urn)  .  |X(w)|exp[Je|(w)]  (I) 

where  the  phase,  e^u)  Is  defined  by 

♦„(•)  •  ten~,llj(*)/lg(w))  12) 

ana  assumes  values  within  the  range  T-v,  «]  . 
Note  that,  In  addition  to  the  value  of  the  ratio 
R(w)*X,(«,VXK{«),  knowledge  of  e,(w)  assumes 
that  the  s‘gn  of  kR(w)  and  the  sigh  of  X,(w)  are 
known  for  each  frequency.  Therefore,  since 

*g(w)  •  |R(w)l  cos  tKM  (3a) 

*,(w)  •  |l(w)|  sin  ♦,(«)  (3b) 

knowledge  of  e  (w)  Implies  that  the  hard-clipped 
versions  of  X.\«)  and  x.(»)  are  known.  It  Is 
the  set  or  *iero-trdtstngs*  of  *.(u)  or 
of  i.(»)  which  provide  a  key  piece  of  information 
about  X(w)  and,  consequently,  about  x(n).  For 
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example,  It  may  be  them  that  |X(w)|  along  with 
the  *iero  crossings-  of  »p(«)  provide  a  unique 
specif teat  ton  of  a  finite  duration  causal 
sequence. 

Instead  of  d* fining  the  phase  of  X(w)  as  In 
(2),  which  presumes  knowledge  of  the  zero  cross¬ 
ings  of  *.(<•)  and  X.(w)  ,  suppose  that  the  phase 
of  X(u)  Is  defined  uy  taking  the  principle  value 
of  the  arctangent  function  In  (2)  so  that  It  Is 
confined  to  the  range  £-v/2,  t/2)  .  furthermore, 
let  ♦,(“)  be  used  to  denote  this  definition  of 
the  phase  of  x(w)  and  let  us  refer 
to  It  as  the  angle  of  X(u).  Dote  that  a  (w)  and 
tanta  («)]  are  equivalent  pieces  of  Information 
about  x(u).  therefore,  let  X(w)  be  written  as 

X(w)*ix(»)l  exp£jax(«)) 

•  |X(u)|  exp  ♦  *,(>■>)]  (4) 

•*„(«)  exp  [a^t-)} 

where 

/  (-l-IX(w)l exp£ ja (<- » j* IX (« ) I sgn[cos  a, (w> ] 

*  (S) 

Is  defined  to  be  the  amplitude  of  X(w).  Note 
that  a  (“)  In  (S)  Is  equal  to  lero  or  «  for 
each  U,  t.e.,  tan  [a  («)]*0  for  all  u  .  Ho  re 
specifically,  a. M«0  whenever  a,(w)  Is  within 
the  Interval  f-v/2,  v/2)  anrf  it  1$  equal 
to  X  Otherwise.  Therefore,  the  amplitude  of 
X(w)  may  equivalently  be  expressed  as 

I X (w J |  If  -t/2<a,(w)<v/2 
*.(»)  *  *  (i) 

-IX(w)l  otherwise 

Thus,  spectral  amplitude  contains  spectral  magni¬ 
tude  Information  along  with  one  bit  of  phase 

Information,  l.e.,  *K(“)  ■ 

Finally  It  should  be  pointed  out  that 
whereas  IX(  w)|  is  a  continuous  function  of  w, 

A  (w)  is  discontinuous  at  tnose  points 
where  e,(»)  passes  through  t  v/2  ,1.e.,  at  those 
frequences  idiere  X-fw)  passes  through  aero. 
Thus,  A  (w)  contains  information  about  both  the 
magnitude  of  the  transform  as  well  as  the  aero 
crossings  of  the  real  part  of  the  transform.  For 
example,  shown  In  Figure  1  Is  the  magnitude, 
phase,  amplitude,  and  angle  of  the  Courier  trans¬ 
form  of  a  discrete  time  signal  of  length  N-4. 
Note  that,  as  defined  In  (5),  the  mtplltude  of 
X(w)  1$  discontinuous  at  those  frequencies 
where  «  (»)  •  t  v/2  and  Is  negative  when  the 
phase  It  outside  the  Interval  £-v/2,  v/2). 


UNIQUENESS  IN  TEWS  OF  AfTlITUOE  INFORMATION 

Although  a  finite  length  sequence  which  has 
no  aero  phase  component  Is  uniquely  defined  to 
within  a  scale  factor  by  Its  spectral  phase. 
Spectral  magnitude  does  not  place  enough 
constraints  on  a  finite  length  sequence  to  Insure 


a  unique  solution.  Specifically,  If  v(n)  It  a 
finite  length  sequence  with  a  t-transform  X(a) 
which  h.ss  zeros  at  a^  ,z,  ,...,z„  then  by  replac¬ 
ing  any  one  or  more  or  these  aeros  with  their 
conjugate  reciprocals,  l.e..  replace  the  aero  at 
t*zk  with  one  at  t*l/zk*  then  the  resulting  se¬ 
quence  will  have  the  same  spectral  magnitude. 
Although  aero  flipping  necessarily  preserves 
spectral  magnitude,  aero  flipping  must  result  In 
a  sequence  with  a  different  spectral  phase.  A 
question  of  Interest,  therefore.  Is  whether  or 
not  aero  flipping  results  In  a  sequence  with  a 
different  spectral  amplitude.  Without  some  addi¬ 
tional  Information  or  constraints,  however,  this 
Is  not  always  the  case.  For  example,  consloer  an 
arbitrary  finite  length  sequence  x(n)  which  has  a 
spectral  amplitude  given  by  A(|  u).  With 
y(n)*x(-n),  note  that  the  spectral  magnitudes  of 
x(nj  and  y(n|  are  the  tame.  In  addition,  the 
spectral  phases  of  x(n)  and  y(n)  are  related  by: 

♦„(-)  *  -*yM  (2) 

Consequently,  It  follows  tnat  the  aero  crossings 
of  the  real  parts  of  tne  transforms  of  x(n)  and 
y{n)  [the  frequencies  for  which  the  spectral 
phase  is  equal  to  t  */2]  are  the  same  and, 
therefore,  that  the  spectral  amplitudes  are 
identical.  Causality,  however,  will  eliminate 
this  mvblguity  ana.  In  fact  Is  a  sufficient  con¬ 
straint  for  a  finite  length  sequence  to  be  uni. 
quely  defined  by  its  spectral  amplitude.  More 
precisely: 

Theorem  1:  If  x(n)  and  y(n)  are  causat 

finite  length  sequences  and  If  A ,(w)*A,(w) 
for  all  w,  then  x(n)»y(n).  J 

Note  that  there  Is  now  a  duality  which  Is  similar 
to  the  duality  found  in  the  uniqueness  of  a  mini¬ 
mum  phase  sequence  In  terms  of  Its  spectral 
magnitude  or  spectral  phase.  Specifically,  from 
Theorem  i  above  and  the  uniqueness  theorems  con¬ 
cerning  signal  reconstruction  from  phase,  the 
following  corollary  Is  now  Immediate: 

Corollary  |:  A  causal  finite  length  se- 
quence  it  uniquely  defined  by  the  amplitude 
of  Its  Fourier  transform  and  to  within  a 
scale  factor  Oy  the  angle  of  Its  Fourier 
transform  If  x|a)  contains  no  conjugate 
reciprocal  zeros. 

Note  that  although  Theorem  1  Is  founded  on  a 
specific  d?f1nit‘on  for  the  arapMtude  of  the 
Fourier  transform  pf  a  discrete-time  signal.  It 
Is  possible  to  adopt  a  more  general  definition. 
Specifically,  note  that  the  spectral  amplitude  it 
defined  In  (j)  to  be  equal  to  Its  spectral  magni¬ 
tude  when  the  phase  Is  within  the  interval 
£-v/2,  t/2)  and  it  Is  defined  to  be  minus  the 
spectral  magnitude  when  the  phase  Is  outside  th's 
Interval.  As  previously  noted,  with  this  defini¬ 
tion  of  spectral  amplitude,  knowledge  of  the 
amplitude  of  the  lourler  transform  of  a  signal  Is 
equivalent  to  knowledge  of  tne  Fourier  transform 
magnitude  along  with  those  frequencies  for  which 


the  ph ase  of  trie  focrler  transform  Is  equal  to 
one  of  too  possible  values,  t  »/2  .  By  choosing 
other  values,  different  definitions  for  the  am- 
pl'tude  may  be  obtained.  For  example,  let  w  be 
an  arbitrary  number  olthln  the  Interval  [•«“  x) 
and  consider  defining  the  amplitude  of  the 
Fourier  transform  of  a  discrete  time  signal  as: 

IX(w)|  If  »<♦(«)<-*. 

A_<«:0  •  0  1  0  Id) 

0  -|>(w)|  otherwise 

In  this  knowledge  of  A  («:«•.)  Is  equ4v«* 

lent  to  knowledge  of  the  mag^tude°of  *(*)  along 
with  the  frequencies  for  which  the  phase  of  k(w) 
is  equal  to  either  •  or  «  ♦*.  fiote  that  al¬ 
though  Theorem  1  °consid?rs  the  case  for 
wh*cn  wQ  •  it  nay  be  shown  to  hole  for  all 

values  of  h»o  except  for  wQ»0  ,  5pecif fcally,  [4j 

Corel  lary  2:  let  »(n)  and  y(n)  be  two 

cauSaT  finite  length  sequences.  If 
)»A  (u:*  )  for  all  w  with  w  #i)  then 

afnWf").*  0  0 


UNIQUENESS  IN  TERMS  OF  AMPLITUDE  SAMPLES 

In  the  previous  section,  some  uniqueness 
results  were  presented  assuming  that  the  spectral 
amplitude  of  a  finite  length  sequence  Is  arewn 
for  all  frequencies  In  the  Interval  [0,2x).  In 
the  case  of  spectral  phase  or  spectral  angle  It 
is  poss'Dle  to  generallae  the  uniqueness  results 
to  the  case  in  which  spectral  phase  or  spectral 
angle  Is  known  only  for  a  finite  number  of  dis¬ 
tinct  frequencies.  Specifically,  It  has  been 
shown  that  for  a  finite  length  sequence  of  length 
h  which  has  no  symmetric  (zero-phase)  factors  In 
Us  z-trjnsform,  any  (As- 1 )  samples  of  either  Its 
spectral  phase  or  spectral  angle  Is  sufficient 
to  uniquely  define  the  sequence  to  within  a  scale 
factor  (1J.  Unfortunate'y.  however,  a  finite  set 
of  amplitude  samples  Is  not  always  sufficient  to 
uniquely  specify  a  causal  finite  length  se¬ 
quence.  For  example,  consider  the  following  two 
ceusal  sequences  of  length  h*3 

«(n)  •  t.Ot(n)  ♦  2.64(n-l)  ♦  l.24(n-2) 

(y) 

y(h)  •  1.24(h)  ♦  2.bi(n-l )  ♦  l.u*(n-2) 

Since  y(n)  Is  obtained  from  x(n)  by  flipping  both 
of  the  zeros  of  Hi)  about  the  unit  circle,  both 
«(n)  and  y(n)  have  the  same  spectral  magnitude. 
Furthermore,  in  the  Interval  (0.  «),  the  real 
part  of  the  Fourier  transform  of  x(n)  1$  equal  to 
zero  at  only  one  frequency,  w  *.477u23  •  and  the 
real  part  of  the  Fourfer  transform  of  y(n)  Is 
equal  to  zero  only  at  w*.S2olbo  v.  Tnerefore, 
the  amplitude  of  *(<•)  Is  equal  to  the  amplitude 
of  Y(w>  for  all  w  outside  the  Intervals 
(. 477023a,  ,S2blb6i )  and  (-.5261ebx.-.47?023i). 
Consequent ly,  an  arbitrary  numoer  of  enplltude 
samples  within  this  region  is  not  sufficient  to 
distinguish  a(n)  from  y(n).  dote,  however,  that 
One  sample  of  the  amolltude  within  the  Interest 


(.477023  •,  .bTblott  •)  Is  sufficient  to  distin¬ 
guish  a(n)  from  y(n). 

Thus,  although  a  given  set  of  samples  will 
not  lead  to  a  unique  specif  teat  ion  of  a  sequence 
In  terms  of  spectral  amplitude  In  all  cases,  it 
may  be  shown  that  a  finite  set  of  samples  may 
always  be  found  which  provide  this  un'que  speci¬ 
fication  (4).  In  particular,  any  causal  finite 
length  sequence  of  length  H  may  be  uniquely  de¬ 
find  by  the  spectral  magnitude  of  Its  M-potnt  OFT 
provided  M  Is  chosen  large  enough. 


RECONSTRUCTION  FROM  AMPLITUDE 

In  this  section,  the  problem  of  reconstruct¬ 
ing  a  finite  length  sequence  from  the  amplitude 
of  Us  Fourier  transform  Is  addressed.  «s  pre¬ 
viously  discussed,  a  given  finite  set  of 
amplitude  samples  is  not  always  sufficient  to 
uniquely  specify  the  sequence,  in  this  section, 
however.  It  will  be  assumed  that  the  unanown 
sequence.  «(n),  is  zero  outside  the  interval 
[D,h-lj  and  that  the  amplitude  of  'ts  M-po‘nt 
OFT,  Aa(t),  Is  known  and  that  N  Is  large  enourn 
to  insure  a  unique  specification  of  »(n). 

Motivated  by  the  Iterative  algorithm 
originally  proposed  by  Gerchberg  rno  Saxton  ibj, 
an  Iterative  procedure  has  been  useo  in  the  re¬ 
construction  from  amplltuoe  problem. 
Specifically,  the  problem  may  be  vleweo  as  one  In 
which  some  signal  constraints  are  known  ootn  In 
the  time  an#  frequency  domains.  In  particular. 
In  the  time  domain  x(n)  is  known  to  have  Its 
support  confined  to  the  Interval  LD.N-1J  and  In 
the  frequency  domain  It  Is  known  to  have  an  M- 
point  DFT  with  amplitude  *,(k).  The  Iteration  Is 
thus  characterized  by  the  repeated  transformation 
between  the  time  and  frequency  dome. ns  where  In 
eacn  domain  and  at  each  step  in  the  iteration  the 
signal  constraints  are  Imposed  on  the  current 
estimate.  The  Incorporation  of  the  time  domain 
constraint  Is  straight-forward  since  It  involves 
limply  a  windowing  operation.  There  are  several 
alternatives,  however,  for  Imposing  the  frequency 
domain  constraint.  Specifically,  with  1  (0 )  the 
estimate  obtained  after  t  Iterations, 

let  a  (k)  be  Its  M-potnt  OFT  which  has  an  ampli¬ 
tude  ‘given  by  A  (k)  .  The  frequency  domain 
constraint  to  be*  placrd  on  X  (k)  Is  f.e  known 
spectral  amplitude  of  a(n),  A  fk)  .  I.,  the  com- 

pttx  plane,  the  OFT  of  a  An)  Is  thus  constrafned 
to  He  On  the  semicircle  fleflneo  by  the  Inter¬ 
section  of  a  circle  of  radius  A  (k)  and  the  half 
plane  of  all  positive  real  parts  If  A  |k)>0  or 
the  half  plane  of  all  negative  real  parts  if 
Aa(k)<0.  Thus,  the  known  amplitude  Imposes  both 
a  magnitude  as  well  as  a  phase  constraint 
on  x,(k)  .  The  fleitblllty  In  Incorporating  the 
amplitude  Information  lies  In  the  methoo  by  which 
the  phase  Information  Is  imposed.  Since  there  1$ 
no  reason  for  altering  the  phase  of  A,|k)  If  <t 
lies  within  the  correct  interval,  ‘the  only 
question  Is  what  phase  should  be  used 
for  X  (k)  when  the  phase  of  xt(k)  falls  outside 
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the  given  Interval.  On*  possibility  It  to  tot 
the  phase  to  xero  If  the  phase  of  X(k)  l«  know, 
to  11*  in  the  Interval  (-v/2.  a/2)  ana  to  <et  H 
equal  to  «  otherwise. 

another  poitlblllty  for  Incorporat ing  th* 
given  amplitude  Information  It  to  tet  the  mapll- 
tutfe  of  Xtt)(k)  equal  to  th*  known  amplitude. 
*»(*»! 

*tt,(k)  •  A^k)  (10) 

with  this  approach,  x  (k)  It  tealed  to  that  It 
hat  the  correct  magnitude  and  then.  If  necestary, 
a  phate  of  *  It  added  to  X,(k)  .  However ,  If  th* 
real  part  of  X(k)  It  clot?  to  ter o  and  the  tlgn 
of  the  real  part  of  X,(k)  dlffert  from  that  of 
X(k).  then  the  Incorporation  of  the  amplitude 
conttralnt  (10)  will  tlqnlf Icantly  Increate  the 
error  between  X(k)  and  X.^tk)  .  Another  potsl- 
blllty,  therefore.  It  ft  limply  teal*  X,(k)  to 
that  It  hat  tne  correct  magnitude  and  tlten  tet 
the  tlgn  of  tne  real  part  of  It-  ,  (k  J  equal  to  the 
tlgn  of  the  real  part  of  X4(k)‘. 

With  either  of  thete  latt  two  approachet  for 
Impotlng  the  frequency  domain  conttralnt,  the 
Iterative  procedure  hat  been  obterved  to 
converge.  In  most  catet.  to  th*  correct  tequence 
when  i(n)  It  uniquely  defined  by  the  amplitude  of 
tti  H-poInt  Of T.  A  theoretical  proof  of  conver¬ 
gence,  however,  hat  not  yet  been  obtained. 
Although  the  number  of  Iteration!  required  to 
reach  a  convergent  tolutlon  It  In  general  very 
large,  thlt  number  tend!  to  decreate  as  the 
length  of  the  OFT  It  Increatcd. 
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This  pipvr  prtMflta  some  details  of  a 
sew  formalise  *Meh  allow*  for  lha  alavl* 
ten mourn  fMcription  and  aanipulat Ions  of 
Moth  tha  artthatUc  and  lepleeentet tonal 
character  lattes  of  Digital  Signal  Procaa* 
aln?  algorlthaa.  This  fornallas  loads  to 
procedures  for  tha  automatic  and  optimal 
implementations  of  a  larya  class  of  algo- 
tithes  bastf  an  bath  3SIM  compilation 
techniques  and  rigorous  systolic  derive— 
tions  as  wall  so  coablnad  approaches* 

Introduction 

Tha  fundaaantal  f^al  of  this  research 
la  to  develop  aethoda  for  tha  automatic  and 
Optisal  realisation  df  a  large  class  of 
Digital  Signal  Processing  <D3P>  algorlthaa 
on  synchronous  multiprocessors  composed  of 
multiple,  identical  prograaaable  proems- 
ears.  This  research  seek's  to  find  tha  aoat 
efficient  poaaiola  •cUtiom,  In  which  the 
intrinsic  synchrony  of  tha  ayatas  saint a ins 
ths  data  pracadenca  relations,  and  in  which 
sa  cycles  of  say  of  the  processors  are  uaad 
far  ayataa  control.  DSP  algorithms,  as  a 
class,  srs  uniquely  wall  aultad  la  this 
approach  both  because  of  their  coaputa- 
tlonal  intensity  and  because  of  their  high 
lava!  of  internal  structure. 

Thle  research  has  both  s  theoretical 
end  on  eaperlsental  coapanent.  In  the  theo¬ 
retical  component.  a  unified  formal  tea  has 
baas  developed  which  allows  for  tha  aleut- 
taneoua  description  and  eanipulation  of 
both  tha  aritheatia  and  laplesantetlonal 
characteristics  of  tha  algorlthaa.  This 
foraalisa  h«'»  bean  uaad  In  turn  to  develop 
eeenlngfwl  dafinltiana  far  optimality  and 
to  develop  algorlthaa  far  the  officiant 
ewtasatle  generations  of  aptlsal  suit  I* 
P' ocaaaar  implementations  ll-d>.  In  the 
eaperlsental  component.  a  synchronous 
aulti-aicioprocesaor  computer  and  operating 
ayoteo  have  bean  developed  (4)  and  e  com¬ 
piler  which  generate*  optisal  suit Iprecee- 
•or  lapleaental lone  for  signal  flow  graphs 
haa  bean  demonstrated.  The  suit  I  processor 
eystoo  la  fundamentally  a  research  tool 
which  ia  invaluable  in  verifying  and  aug- 
•anting  the  theoretical  research. 


in  this  research,  the  algorithsa  to  be 
implemented  era  all  described  using  a  gen¬ 
eralised  flow  graph  representation.  fa  ia 
Illustrated  In  Fig.  I,  a  generalised  flow 
graph  la  a  directed  graph  tn  which  all 
oparstiena  occur  at  tha  nodes,  and  tha 
branches  are  used  exclusively  so  signal 
paths.  Tha  goner si lead  flow  graph  la  a  very 
powerful  representation  which,  if  properly 
applied,  le  not  only  capable  of  describing 
such  traditional  signal  flow  graph  struc- 
turss  so  digital  filters  and  fast  trane- 
fares,  but  also  ouch  nan linear  structures 
as  those  involving  decimation.  interpola- 
tion.  homomorphic  processing,  and  a  large 
class  of  aatrtm  operations.  In  addition, 
by  allowing  the  nodes  to  bo  loo  level  logic 
operations.  generalised  flow  graphs  can 
also  describe  bit-serial.  byte-serial, 
end  aeny  other  distributed  arithmetic 
structures. 

A  fully  mpgpified.  flow  graph  la  a 
generalised  flow  graph  in  which  tha  node 
operations  are  all  fundamental  operations 
Of  the  constituent  processor  on  which  the 
algorithm  will  be  ispleeonted.  The  defini¬ 
tion  of  the  node  operations  In  the  fully 
•pacified  flow  graph  sets  tha  granularity 
with  which  the  parallel  tea  can  be  ex¬ 
ploited.  Penerlc  flow  graphs  are  all 
these  general  lead  flow  graphs  which  do  not 
seat  tha  conditions  to  be  a  fully  specif lad 
fleu  graph.  In  general,  tha  nodes  on  gen¬ 
eric  flow  graphs  contain  sacra  oparstiena 
which  involve  aultlple  aparationa  of  the 
constituent  processor.  Typically,  a  number 
of  different  fully  specified  flow  graphs 
can  be  generated  froa  single  generic  flew 
graph  < see  Fig.  l>. 

rigw  £rgeh_%gvndi 

divan  that  only  ana  pracaeear  type  ia 
to  be  used  in  the  eventual  suit l processor 
Ispiesentst ton  and  given  that  the  charac¬ 
teristics  of  this  constituent  processor  arm 
known,  than  It  la  possible  to  compute 
bounds  an  the  synchronous  suit i processor 
realisation  of  e  fully  specified  flow 
graph.  Two  bounds  era  of  particular  inter¬ 
est.  Tha  first  bound,  called  the  agaplg 
PfCAff*  t£*nA«  Involves  tha  alninsa  saapl.ng 


*-l 


purled  »t  MUick  a  p»rtloiU»  algor  I the  can 
»•  iaplM«nt«tf  MlM  «  particular  ««- 
etltmrnt  processor.  Thu  ••■pic  per  led 
pound  ia  pact  an dvr stood  in  the  content  of 
«  rcour nice  eingle-t iae-inde*  f lev  frapp 
4  each  aa  aa  UR  digital  filter).  although 
the  concept  ia  alao  meaningful  in  systems 
which  have  no  explicit  sampling  period, 
for  auch  ayeteme.  tha  aaapla  period  Pound 
id  given  by 


where  p  varied  over  the  act  off  all  loope  in 
the  floe  graph#  d  ia  the  arlthcetlc  delay 
in  the  loop  p  and  R  ia  the  ntiaber  of  unit 
delay#  nodea  in  loRp  p.  Thie  reauit  ie  e 
penerallsat Ion  of  a  reauit  puPliehed  py 
Renfore  and  Ruevo  «3>. 

The  aecond  beund  of  Interest  le  the 
ff]ft  feevft#..  Thie  ie  the  a  ini  aye  tiee  re¬ 
quired  between  the  availability  of  an  lnpat 
aaapie  and  the  computation  ef  the  cor* 
responding  output  eaaple.  The  delay  Pound 
le  given  Py 

P •  RIR  C  d  I 

R  P 

where  d  ie  the  arithmetic  delay  in  the 
peth  p.  ”end  p  include a  ail  loop  free  paths 
free  tha  input  to  tha  output.  Xn  thie 
computation,  delay  eleaente  era  aaauaad  to 


It  le  important  to  note  that  these 
Pounds  ere  aeeocleted  with  fully  specified 
flow  graphs,  and.  as  ie  Illustrated  in  Pig, 
t#  different  fully  apeclfied  flow  graph 
real i rat Iona  for  the  eaaa  generic  flea 
graph  say  have  quite  different  Pounds. 
Renee#  a  first  step  In  finding  an  optical 
eolation  involved  ehooetng  the  Poet  fully 
epeoifted  flee  grarh  for  the  desired  lapis* 
mentation. 


This  vorh  aahee  use  of  threo  aoparete 
daflnitiona  of  optimality.  In  implementa¬ 
tion  I#  sold  to  he  tft-tgt  gfi&t.ail  if  It 
oehtovee  the  sampling  period  Pound*  An 
I  ■pieman  tat  ion  ie  said  to  be  delay.  gptiapi. 
If  it  achiavee  the  delay  Pound.  An  im¬ 
plement  at  ion  ia  maid  to  be  pfpcgftRC  99%iz 
•Cl  if  It  aahlPlta  perfect  processor  ef¬ 
ficiency  so  that  every  cycle  of  every  pro- 
ceeeor  ia  used  directly  on  the  fundamental 
operations  ef  the  algorithm  end  no  eye lea 
ere  used  for  synchronisation  or  ayataa 
control.  Clearly,  theme  three  definitions 
ef  eptleality  are  non-eacleeive.  and  any 
particular  implementation  may  satisfy  any 
combination  of  those  optimality  criteria. 


inuiUUtttl 

One  particularly  interesting  approach 
to  the  flow  graph  laplaaentatlen  probleo 
involves  the  uae  of  99ICP  < Shewed  Single 
Instruction  Multiple  Petal  realisations  11- 
41.  the  Peafc  3SXNP  concept  in  illustrated 
Py  the  eaaaple  of  fig.  2.  In  9SXRD.  exactly 
the  aaae  prograa  in  eaeouted  on  each  of  the 
processors  in  the  nelti processor  and  that 
prograa  realises  eaectly  on#  tiae-lteratlen 
ef  the  flow  graph.  In  principal,  a  recur¬ 
sive  floe  graph  auch  no  the  third  order 
section  eaaaple  ef  fig.  2  creates  a  recur¬ 
sive  sequence,  rial,  which  ie  required  on 
aucca salve  Iterations  of  the  algoritha. 
Nance,  in  thie  eaaaple.  r<n-3>.  r«n-2> 
end  r(r-l)  are  required  before  rfnl  can  be 
computed.  If  only  a  single  processor  ie 
being  use.  there  le  never  any  date  availa¬ 
bility  issue  ee  tong  as  the  sequences  are 
ceaputed  in  order.  The  hay  point,  however, 
la  that  the  arithmetic  computations  always 
tehe  finite  tin#  and.  for  any  particular 
program  and  constituent  processor,  thaao 
tlaaa  are  well  known.  Nonce,  if.  an  in 
•Keen  in  rig.  2.  two  processor a  onocuting 
the  ammo  prograa  are  used  together,  then 
processor  1  need  not  emit  until  p roc manor  2 
has  completed  the  computation  of  r<n-t)  to 
begin  tho  computation  ef  rial.  Rather, 
processor  1  asy  begin  Its  computation  nt 
the  earl last  time  at  which  it  ie  guaranteed 
that  r(n-ll  will  be  available  Py  the  tiee 
it  ie  needed. 

In  an  33 1  NO  program,  nil  of  the  arlth- 
aetle  operation#  appear  no  nnpllelt  In* 
struct ions,  but  the  delay  nodes  ore  trans¬ 
formed  into  input-output  pairs.  Xn  thin 
way.  the  delay  atructuro  In  the  flow  graph 
Pecoaea  tha  communications  structure  In  the 
3JXN0  real  teat  ion.  In  the  els  processor 
realisation  shown  in  fig.  2  < which  ie  al¬ 
ways  achievable  for  a  third  order  sec¬ 
tion)#  each  processor  operates  on  every 
elnth  tiae  indea.  and  the  processors  are 
shewed  in  tine  by  one  eaaple  period. 

Tor  any  given  program  and  any  given 
constituent  processor,  it  ie  possible  to 
compute  a  sampling  period  beund  for  the 
391RR  realisation  (2-11.  Thie  hound  le 
gives  Py 

3b*RURC  Nj'NjI 

where  e  in  the  delay  between  the  first 
utilisation  of  output  ef  e  daisy  node  end 
the  ties  at  which  the  input  ie  the  delay 
node  to  computed  and  n  la  tha  order  ef  thw 
delay  in  tha  daisy  no&e.  Thin  $$1RD  ppuqd 
in  eaectly  tha  aaaa  Pound  for  programs  that 
the  sampling  period  Pound  In  for  fully 
apeclfied  flow  eraphe.  Nance,  if  a  prograa 
eon  be  generated  auch  that  the  33IND  Pound 
ie  eguel  to  the  saeple  period  Pound,  than 
the  331*0  realization  in  rate-optiaal. 
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LUmIm,  Ifc*  aMtuii  Mrtktt  W  pr*MMor«  IM  pro«rtu  <m  Um  c«MUtM«t  ptocM- 
which  own  N  um4  U  U  Ml  MINO  MTV. 

raiUutiM  it  lift*  hy 

Tm  Utoitmi  toiaU  ahMlt  tt  tttt 
■  •  fT/|  1  ttwctrtitf  tlii  931*9  compiler.  Hrit.  «t 

previously  nwntionwd.  itt  •ttlleallM  to  bp 
whoro  t  la  tM  total  Mratltt  of  all  tho  aa  eeane  ilaltat  to  the  leboritory  swlti- 

oparatieaa  in  tha  proprea  lor.  equivalent-  processor  araunt  which  It  waa  tavalopad. 

ly.  tha  flaw  praph)  end  t- 1  la  tha  an*  it  can  aalta  aaallp  ba  wwwa  la  tap-do** 

celling  funetloa.  dwelyn  apataaa  vamy  oicreproceaoero*  slp- 

nal  procseelap  chip**  ar  VLSI  realisations* 
Tha  391*8  approach  to  flow  praph  real-  Second.  and  aora  ieportant.  la  tha  result 

isatiaaa  la  vary  eltrective  for  weny  ree-  that  if  a  rata-optiaal  991*9  solution  es- 

•ora,  first*  far  alt  391*9  realisations  In  lata*  it  is  vary  aiopla  to  find.  Stated 

which  tha  awabar  af  processors  is  taaa  another  way*  tha  infaraattan  available  froo 

thaa  I  ,  tha  isplaaontatlana  ara  pracaaaar-  tha  coaputation  af  tha  flea  praph  bounds 

eptiael  and  tha  use  af  *  processors  always  defines  aa  praciaaly  tha  character  of  a 

Increases  tha  threuph-pwt  hy  a  factor  af  rete-eptias!  solution  that  it  la  vary  ele- 

sasctly  *  (relative  to  a  ainple  pracaaaar  pi#  to  teat  whether  an  optical  991*9  aolu- 

ftnplooantatlon: .  Second*  whan  tha  S91RD-  tlon  aaiata  and  ta  find  it  if  It  dens*  Xn 

hound  in  equal  ta  tha  aawpla  period  bound*  cant r sat*  findinp  tha  boat  sub-epttaal 

an  In  the  case  far  tha  severity  of  recur-  solution  is  such  aora  rawpwtstlanal ly  in- 

niva  dlpital  filter  structures,  than  thare  tense.  Hanca  ae  have  tha  paradea  that  the 

aaiata  no  suit 1 processor  aalutlan  usiap  tha  aaat  desirable  opt last  solutions  ara  tha 

sea#  constituent  processor  which  is  faster  easiest  ta  find*  but  they  say  not  always 

or  earn  efficient.  Third*  altheuph  tha  salat* 

aespiw- par  lad- hound  concept  la  not  in- 

waleod*  991*9  realisations  ward  equal ip  *lporoue _3yntoI lc  Pur  lest long 

wall  far  sen-recursive  structures*  finally*  \ 

and  east  important,  tha  all-iapertant  cea-  39ZN9  rapreaanta  ana  poaalbla  Mphly 

Busiest ioas  architecture  far  tha  final  const sained  approach  ta  synchronous  eultl- 

laplweentetjea  la  caaplatalp  specified  by  pracaaaar  iapirarntationa.  Another  Mphly 

tha  delay  node  structure  af  tha  flow  praph.  constrained  approach  la  that  defined  by 

tn  particular*  bp  conatrainisp  all  tha  systolic  arrays  *?>.  In  M»e  recant  peat,  a 

delay  nodes  to  ba  first  order*  all  elnpt«-  larpa  nuabwr  af  apatalic  alparfcthaa  haws 

tiaa-lndaa  l 1-di  sens  i  anal  I  991*9  solutions  appeared  in  tha  11  tar  stars,  far  tha  ml 

caa  ba  realised,  with  n  neareet-neiphbor  part*  these  slyer It has  heee  net  been  de- 

wM directional  rlnp  fa  eintler  result  ap-  rived  ar  proved  in  any  ferns!  wap  but  hew« 

plian  ta  2-dlewneional  flew  praphei.  haw-  alaplp  baas  -presented”  without  foraal 

ever.  If  aero  ooepla*  oaanunlcationa  aro  verification  or  proof*  One  of  tha  results 

available*  than  tha  flaw  praph  can  ba  da-  af  tha  application  af  our  foraallea  to 

flood  to  taho  advantapa  of  it  «eeo  ftp*  2>.  systolic  opataao  has  boon  tha  developnent 

of  o .oat  af  * i porous  rules  for  tho  derlre- 
tlon  of  apotolic  isplaoentstleoo  froo  flow 
prapha. 

•a  apt leal  991*9  compiler  far  felly 

spar  if  rod  alpnal  flaw  prapha  has  bean  da-  Two  alnpls-t  lae-lndea  apataaa  ara  said 

ralopod  far  our  labor itery  awltlpracaaaor  to  ba  a«apnt|p||p  apu|%p|epi  u  piran  tha 

donfuter  .  A  blech  dtepraa  far  tha  owlti-  aaaa  inpwt  sequences  thoy  always  pivo  tho 

yroevtaar  coopllor  Is  shows  in  fly*  9*  In  aaaa  output  sequences*  Tha  oyotollo  derive— 

bho  first  step*  tha  eenpl  I  ftp -par  lad-bound  tlon  procedure  is  based  cn  two  thoereoo 

sad  delay-bound  aro  coeputod  for  tho  alp-  conceralnp  tho  ooooatlol  equivalence  of 

net  flow  praph  for  tho  dwotred  constituent  eyatees  doacribad  by  flaw  prapha* 

processor  •  Altheuph  tha  actual  real last loan 

sere  all  bean  tasted  welnp  the  L9I-11  can-  TUt.  ®*T9  . I»T C*U*Yt . . THrpAf*  A  eat  af  N 

etiluent  processor  a#  tho  aultlpraeaaaor  identical  shift  invariant  apataaa  oparatinp 

ceipatyr •  tbo  compiler  o an  bo  alaply  can-  an  *  separata  data  atraaaa  is  essentially 

fipurad  to  coapiie  for  any  appraprlato  equivalent  to  a  ainplo  ay a taw  far  which  tho 

frocraaar.  In  the  ascend  step*  the  inferee-  *  aele  af  S sputa  and  outputs  hare  bean 

tie*  derived  in  the  flrnt  step  la  used  to  inter loovod  aa  aa  ardored  sot  and  tho  ordor 

do  a  Mfhly  pruned  tree- search  te  fine  o  ef  ell  the  delay  nodes  in  the  flow  praph 

reto-eptlaal  991*9  solution,  if  it  aaiata*  has  bean  aultipliod  by  N. 

If  •  rate-apt  leal  solution  is  not  found* 

than  n  save  we  tensive  tiwe-eeeich  is  par-  C9*0ViAR?t  A  shift  Invariant  eyeteo  la 
bo  find  tha  bant  (lowest  991*9-  always  essentially  equivalent  ta  a  shift 

bowndl  solution.  These  991*9  aa let ions  can-  invariant  ayataa  whore  the  iopot  has  boon 

nint  of  lapol  order Inpo  af  the  flew  praph  up-aaapled  by  N*  the  output  has  bean  dewn- 

operatlene.  These  aro  wood  aa  Inpot  to  tho  oeaplod  by  *.  and  tho  order  of  all  tbw 

final  onde-ponorotion  stop  which  eonatroeto  delay  nedoe  has  bass  aultipliod  by  R. 
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A  ■***!  C*tMi  !•  ••  that  set 
•f  branch ••  which  are  out  whan  a  clone* 
surface  in  constructed  inside  n  flow  graph 
in  nucb  a  way  that  it  ban***  throve*  w 
nodes. 

TH1  CUT3CT  OCC.AT  TM«SF0»JUT1OM_  TNEOftfft 
Any  shift  invariant  flow  graph  la  es¬ 
sentially  equivalent  to  a  flow  graph  which 
ia  formed  by  »Mlnf  ideal  delay  (advance! 
nodes  to  ell  the  input  branchee  la  a  nodal 
cutset  and  addin*  ideal  advanoe  (delay) 
node a  to  all  thn  output  branchaa  in  the 
aaae  nodal  cutset. 

Thn  application  of  theoe  thaoraaa  la  11- 
luatratad  In  ll|.  4* 

Tha  way  in  which  these  two  theoraae 
can  be  weed  to  derive  eyetoiic  algorithms 
froa  fully  specified  flow  graphs  la  illve- 
crated  in  Flf.'i  6*7.  a  fundamental  con* 
•tralnt  placed  on  ayatolic  arraya  In  thair 
definition  la  tnat  the  transfer  of  dete 
between  cell*  nuet  be  synchros* *ed.  Thin 
transintea  into  a  flow  fraph  constraint 
that  every  output  branch  froa  a  call  aunt 
be  terminated  l>y  e  delay  nods  (pipeline 
reg later >*  Here#,  the  feneration  of  eye¬ 
toiic  solutions  for  flow  graphs  reduces  to 
distributing  the  delsy  nodes  throughout  the 
flow  graph  eo  that  this  condition  ta  net. 
In  thia  procedure,  tha  sample  pariod  bound 
for  otetic  pipelines  (6)  la  used  to  rteter- 
blno  vnern  the  delay  nodes  ohould  be  redis¬ 
tributed,  the  required  inter leaving  factor* 
end  the  appropriate  nodal  cutsets* 

fif.  9  illustrates  a  ayatolio  derlve- 
tton  for  an  riR  filter.  In  the  ewsaple 
shown,  a  proceeeor-opt laal  implementation 
is  always  attainable  at  a  sampling  period 
of  2  awltiply  ♦  1  add  tlae.  and  a  possibly 
non-proceeaor-optieel  iapleaantal  ton  la 
attainable  at  the  aemieun  of  awltiply  or 
one  add  tlae.  Clearly.  If  the  artthestlc 
operations  thenswlves  wore  plpelti.ed.  as 
eight  bo  the  case  for  •  low-level  elcro* 
coded  processor,  shorter  aeapllnf  periods 
are  poeelble* 

flf.  *  illuetretee  a  ayatolic  dories- 
tlon  for  an  lift  filter.  It  should  bs  clear 
froa  this  anaapie  that  an  up-sampling 
tranaf nraot ion  aunt  be  applied  to  fenerate 
any  systolic  solution  for  •  recursive  eye- 
tee.  This  system  operates  on  one  date 
atreee  at  twice  the  nesple*perlod  bound  and 
with  fifty  percant  processor  afflctency.  tf 
a  second  dete  atrsse  were  available*  then 
It  should  be  clear  froa  tha  data  interlasvs 
theorem  that  it  could  also  bo  processed 
eisul tsneeusly  result  in*  in  a  procnenor- 
optiacl  implementation,  but  still  at  twice 
the  aespla-perlod  bound* 


tlon,  nawely  e  triangular  ayatea  solver. 
The  point  of  tbte  eaaapie  ta  to  illustrate 
the  use  of  the  ayatolic  der&vatioa  proce- 
duraa  on  a  oiaplo  ayataa  which  ia  not  a 
digital  filter,  fwndaeentol ly*  this  deriva¬ 
tion  la  vary  stellar  to  that  of  tha  rscur- 
sfva  filter*  and  it  u  ample  to  underatand 
tha  required  interleaving  of  the  date. 

CcUPfJ-JCltrllaft_2?l  wi  ipa« 

SilW  and  ayatolic  arrays  arm  two 
eatreaa  approoebeo  to  sol v in*  th«  synchro¬ 
nous  avltiproeaasor  iaplaaentatlon  problaa. 
The  fundamental  difference  in  thee#  two 
epproechee  le  elnple  to  under stand.  In  the 
99IHP  approach,  the  e  Iyer  i  tha  is  first 
fully  distributed  in  tlae*  sines  a  separate 
tlae  indea  le  assigned  to  each  processor. 
Then  the  implementation  empltsitely  aapn 
thin  tins  distribution  into  apace  <aee  Fig. 
2> .  2n  the  systolic  approach,  the  elgor- 
Ithaa  ore  alweye  distributed  directly  in 
apace.  Clearly,  fly. ‘a  9-7  illustrate  thet 
e  ayetoiic  derieetion  can  bo  considered  ee 
a  direct  partitioning  of  e  flow  graph  naong 
several  processors. 

bhan  compared  directly  to  systolic 
array  solutions*  39IMD  hat  many  attractive 
features,  unerase  ayatelic  arrays  can  sal- 
don  achieve  the  eeaple  period  bound  end  ere 
only  proceeeor-epimsl  for  apeciat  cases* 
391  HD  la  almost  always  processor-optimal 
and  also  is  often  rate-optimal  as  well, 
■either  technique  le  often  delay-optical. 
931)19  yolns  its  advantage  from  the  feet 
that  by  viewing  the  preblea  tn  the  line 
reference  of  the  individual  processors 
rethwr  then  the  reference  of  the  eyatea 
clock,  a  laaanaoly  com pi am  timing  promise 
ia  transformed  Into  a  relatively  staple 
timing  problaa. 

tut  to  eeko  comparison#  of  this  sort 
la  to  alas  the  aoet  fundamental  point.  That 
is  that  $h9»«  jwq  tntww  (n  tf  ih: 
ttMtlcgllr  VfftftDfd-  ft**  the  point  of 
view  of  33IM0  derivations*  the  problem  to 
that  If  no  reto-optlea!  solution  salats, 
then  the  beat  auto-optimal  solutions  aro 
difficult  to  find,  however .  If  the  opera¬ 
tions  ot  a  single  tlae  indea  can  be  distri¬ 
buted  screes  sever el  processor e*  then  opti¬ 
mal  solutions  (rste-opt lost  and  daiay- 
optiaa\»  always  salat  and  are  (relatively! 
alapie  to  find.  The  oddity  here  is  thet  If 
only  oot laal  aolutlone  are  sought*  then 
fewer  operations  ere  required  to  find  thee 
than  if  sub-optimal  solutions  ere  required. 
This  approach,  wh.ch  has  been  celled  F331MD 
(Parallel  Skewed  Single  Instruction  Multi¬ 
ple  Data)  In  the  peat*  suffers  froa  the 
problem  that  the  communications  archi¬ 
tecture  le  not  eo  oeetly  controlled  ee  le 
the  SS2MD  case. 
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Pig.  7  illustrates  tha  derivation  of  e 
eyetolie  implementation  for  a  aetrin  epere- 


There  le  clearly  so  such  prebtee  le 
the  ayatelic  derivations.  Indeed,  the  oes- 


ounlcationa  ava»*c««ti»aln  ayatalla  attua- 
tur««.  mv  proklM  with  ayataiio  «r«y»  id 
th«i  tK«T  K%««  boon  artificially 
atral n«d  in  rvquirlnf  that  tha  cnmniei- 
tion*  ba  dona  in  aynchrony.  T»*id  aaplicitiy 
4italiov*  tha  Hn^ilUr  inK^rant  in  »IM- 
aka*ln«  Roaovinf  thia  raatnctlon  «Uo«( 
for  tha  dlatrlbvtion-in-tiva  tachn*quaa 
••plorH  in  33IRD  to  bo  coabinttf  with  tho 
ayatolic  darlvatlona  for  noro  officiant  ami 
hifhor  spsad  iaplvaantotiona.  Flf.  0  illud- 
qra too  thia  principal  for  tho  ataplo  caao 
c /  a  third  ©rdar  aaction. 

r If .  H  aho«o  two  apaca~Ctaa  aa- 
alfnaanta  of  ats  proeaaaor  iaploaontat Iona 
of  a  third  orOor  moot  Ion*  Tha  raalirataon 
•hown  in  Flf.  da  ia  prccaaaor~opti»al* 
daisy-optimal.  and  rsta-optlaal.  Tha  altar* 
nato  raaiisatlon  ahoon  in  Fly.  Ob  ia  pro* 
caaaor-opt iaa l ,  t ato-optlaal.  and  roquiraO 
only  naaroat  naifhbor  coaaumcatlona.  both 
tha  roaiiaatlona  of  Fif.  0  and  tho  93ZHB 
roalftsationa  of  Fif.  2  aro  oaaapiaa  of 
oyelo-atatio  pipolino  ooiatlona. 

Tho  concapt  which  anifiaa  thaaa  too 
dfaaiailar  approach*#  ia  tho  tachniaoo 
known  in  difitai  filter  thaory  aa 
-Mockinf fhon  a  /loo  yraph  la  "Plochod-* 
tha  baaio  /loo  «raph  ia  aodlfiad  ao  aa  to 
procoaa  block#  of  fata  rathar  than  oinflO 
data  potato.  Tho  aotlvottoa  for  block 
/tltoro  in  owr  contaat  la  quit#  dlaalailar 
froa  othor  application#.  In  particular* 
blocking  to  a  oay  of  aaplicitly  includinf 
operation*  froa  dlfforant  tiaa  indicia#  in 
tha  aaaa  fully  apoc&fiad  flow  yraph.  fhan 
flltara  ora  b lac had  in  thia  oay  and  tiaa* 
okaood  dalaya  ara  allaoad*  than  331  HD  and 
at  bar  cyeio-atatlc  aoivtiona  ara  available 
fro*  ayatolle  dart vat ion  tachniqvaa  and 
vica  vara a. 
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Rigoco-je  coupM-weve  tmory  of  diffraction  by  dielectric  gretinge  ie  eyUnded  to  cover  E-mode  poieriietion  end 
laaeee.  Unlike  in  the  M-neode-poleriiekioo  ceee.KU  shown  that,  is  the  E'-modt  ceee,  direct  coupling  exieu  between 
all  diffracted  ardent  rather  then  Juet  between  adjacent  erdm. 


INTRODUCTION 

Optical  diffraction  by  dielectric  gretinge  hae  been  the  eubject 
of  extensive,  sustained  research  for  many  year*.  Fields  of 
application  include  aooustonpticx,  integrated  optics,  quni.Uuo 
electronics,  holography,  and  spectroscopy.  Grating-device 
functions  indude  laser-beam  deflection,  modulation,  coupling, 
filtering,  distributed  feedback,  distributed  Bragg  reflection, 
holographic  beam  combining,  wavelength  multiplexing, 
wavelength  demultiplexing,  and  other*. 

A  rigorous  coupled-wave  theory  (without  approximations) 
has  recently  been  formulated  for  dielectric  gratings.1  This 
analysis  applies  for  incident  light  of  H-mode  polarization 
(electric  field  perpendicular  to  the  plane  of  incidence  and 
perpendicular  to  the  grating  vector).  !t  is  the  purpose  of  this 
paper  ( 1 )  to  show  how  the  rigorous  coupled  -  wave  analysis  can 
ha  extended  to  treat  E-mode  polarization  (electric  field  in  the 
plane  of  incidence  and  in  the  plane  of  the  grating  vector)  and 
lossy  grating*.  (2)  to  show  that  coupling  exists  between  all 
diffracted  orders  for  E-mode  polarization  (unlike  the  case  for 
If -mode  polarization  in  which  the  coupling  is  only  between 
adjacent  orders),  and  (3)  to  compare  rigorous  E-mode  results 
for  gratings  with  and  without  losses  with  previous  approxi¬ 
mate  E-mode  results,  rigorous  E-mode  results,  and  approx- 
imat*  E-mode  results. 

GRATING  WAVE  EQUATIONS 
General  Vector  Wave  Equations 

The  lossy  dielectric  grating  is  characterised  by  a  relative 
permittivity  that  is  periodic  and  is  given  by 

t(x.x)w(,-f  <,  co*|K(x  sin  «  +  r  cos  «)|,  (1) 

where  J0  is  the  average  complex  relative  permittivity  given 
by 

Jo-«o-|W«c*.  (2) 

tf  is  the  average  relative  permittivity,  «o  is  the  average  con¬ 
ductivity  (representing  the  nonspatially  varying  losses),  u  is 
the  optical  radian  frequency,  to  is  the  permittivity  of  free 
•pace,  t|  is  the  amplitude  of  the  sinusoidal  relative  permit¬ 
tivity,  *  is  the  grating  slant  angle.  K  it  the  magnitude  of  the 
grating  vector  given  by  K  ■  2»/A,  and  A  ia  tha  grating  period. 


The  planar  boundaries  of  the  grating  are  perpendicular  to  the 
t  direction atz  “ Oand r  «d.  Although Eq.(l) represents 
the  particular  case  of  a  sinusoidal  permittivity,  other  grating 
profiles  can  also  be  treated.  The  electromagnetic  fields  inside 
a  planar  lossy  dielectric  grating  with  a  spatially  varying  rela¬ 
tive  permittivity  are  given  by  vector  wave  equations  obtained 
directly  from  Maxwell's  equations.  The  electric-field  vector 
wave  equation  is 

v*E  +  v|e*^J  +  *»<(*.*) E  -  0.  (3) 

where  E  is  the  electric  field,  r(x,  z)  is  the  periodic  complex 
relative  permittivity  (dielectric  constant)  k  “  2»/A,  and  A  it 
the  free-tpace  wavelength.  Similarly,  the  magnetic-field 
vector  wave  equation  ia 

V*H  +  —  XVxB  +  IMr.ilff-fl,  (4) 

t 

where  His  the  magnetic  field.  These  general  wave  equations 
may  be  considerably  simplified  for  particular  incident  wave 
polarizations. 

ff-Mode-Polartzation  Wave  Equation 
For  E-mode  polarization  (electric  field  perpendicular  to  plane 
of  incidence  and  perpendicular  to  the  grating  vector),  the 
electric  field  ia  solely  in  the  y  direction,  and  so  E  ■  Ej,  where 
$  is  the  unit  vector  in  the  y  direction.  Because  the  electric 
field  is  perpendicular  to  the  grating  modulation  vector,  then 
E  •  Vs  -  0.  Electric-field  vector  wave  Bq.  (3)  therefore  re¬ 
duces  to  the  scalar  Helmholtz  wave  equation 

V*E  +  kVfz,  »)E  »  0.  (M 

This  ia  the  equation  that  ia  commonly  solved  in  the  analysis 
of  dielectric  grating  diffraction. 

E-Modo-PoUrixalton  Wave  Equation 
For  E-mode  polarization,  the  electric  field  ia  in  the  plane  of 
incidence,  and  this  plane  contains  tha  grating  vector.  The 
magnetic  field  ia  solely  in  the  y  direction,  and  so  R  *  Hi. 
Because  the  magnetic  field  is  only  in  the  y  direction,  it  is  ad¬ 
vantageous  to  select  and  to  work  with  tha  magnetic-field 
vector  wave  Eq.  (4).  This  vector  wave  equation  may  be  sim¬ 
plified  by  using  the  vector  identities  v<  x  v  x  v  (v«-J7) 
-  (vr  •  v)77  -  (R •  v)v«  -  R  x  (v  x  Vr)  and  v  x  v<  ■  o. 
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For  f'laodt  polarization,  H  is  perpendicular  to  Wr.  and  thua 
Vi'R  -  Oand  (77-  V)V«  «  0.  The  inagnstic-ficid  vector  wave 
equation  thua  reduces  to 


'-(?•+ 


+  k*r(x.x)//-0. 


This  equation  contains  an  additional  tens  in  comparison  with 
£-mode  wave  Eq.  (S). 


term  into  standard  form,  the  leading.factor  is  expanded  in  a 
Fourier  series  as 


-  K7g->— t  da  txpUhll-f),  (11) 

»*+  r,cos(K-P)  a— 

where  da  ■  -  t[<«o/«i)*  -  l]l/J  -  (fo/eiM*  (or  ki  1,  d-a  * 
-da,  and  d0  “0.  For  the  £ -mode  case,  the  magnetic  field 
is  expanded  in  space  harmonics  as 


COUPLED-WAVE  EQUATIONS 

//-Mode  Coupled-Wave  Equations 

The  //-mode  polarization  coupled-wave  equations  may  be 

obtained  by  expanding  the  electric  field  in  space  harmonics 


£<*,  *)--*  £  S,(r)exp(-;o.-r),  (7) 

where  i  is  the  integer  space-harmonic  index,  S,(z)  is  the 
space-harmonic  electric-field  amplitude,  a,  ■  Jf j  -  iK  from 
the  Floquet  theorem,  and  is  the  wave  vector  of  the  zero- 
order  (i  »  0)  refracted  wave  in  region  2,  the  grating  region  0 
£x  Sd.  (Region  1  is  the  input  region  z  £  0,  and  region  3  is 
the  output  region  z  it  d.)  The  magnitude  of  is  *2  « 
2x(fo)' '  VA.  Each  spare  harmonic  -S,(z)  inside  the  grating  is 
phase  matched  to  a  forward-diffracted  and  a  backward-dif¬ 
fracted  wave.  These  waves  may  be  either  propagating  or 
evanescent.  Substitution  of  Eq.  (7)  into  Eq.  (5)  leads  to  an 
infinite  exponential  series  in  terms  of  S,  (z ).  Each  coefficient 
may  be  expressed  as  a  function  of  i  and  z,  and  each  exponent 
as  a  function  of  i  and  x.  For  nontrivial  solutions,  each  coef¬ 
ficient  must  be  equal  to  zero.  This  gives  the  coupled-wave 
equations.  After  simplification,  the  //-mode  coupled-wave 
equations  are 

1  d-S,(r)  .  2|(io-r;ain/Ol  'a  rcos»ldS,(z) 

2a1  dz2  1  x  |  A  A  ]  dz 

+  S.U )  +  [S,+i(z|  +  S,-i(z)|  -  0.  (8) 

where  S'  is  the  angle  of  incidence  in  region  1  of  the  input  plane 
wave,  «r  i*  the  average  relative  permittivity  in  region  I,  and 
m  is  defined  as 

m  a  2(A/A)|</,/2  sin  <p  sin  S'  +  (<0  -  <i  sin2  S')1' 3  cos  d]. 


When  the  real  part  of  m  is  an  integer,  this  represents  a  Bragg 
condition.  These  rigorous  coupled -wave  equations  may  be 
solved  by  the  state  variable  methods.-  and,  together  with  the 
appropriate  boundary  conditions,  all  the  diffracted  fields  may 
be  determined.1 

F-Mode  Coupled-Wave  Equations 

The  vectorial  £-mode  wave  Eq.  (61  can  also  be  reformulated 
as  a  set  of  sc* Ur  coupled-wave  equations.  The  vector  term 
may  be  expanded  as 

/▼«  fisin(ff.P)  I.  d H  SH\2rr 

t «  /  *o  + « i  cos(ff -  Pit  ix  ill  A 


«(z,r).  E  UWrxpl-^.P),  (12) 

where  U ,(z)  is  the  apace-Sarmonic  magnetic-field  amplitude 
and  the  other  quantities  are  defined  as  before.  Substituting 
Kqs.  (10M12)  into  Eq.  (6)  and  proceeding  as  before  gives  the 
£ -mode  coupled-wave  equations  as 

1  d*t/,(z|  .  2  [(«o - </ sin F)l/a  rcosdldWz) 

2r*  dz*  “iw|  A  A  J  dz 

.coed..  .  d U.-kM  .  2 Um-i)...  . 

+  ~UM 

+ j;  It'.*  ,(z>  +  i/..,(z)i + ~  e  |i  -  h  -  |J 

xA*l/,.*(z)-0.  (13) 

These  equations  for  £-mode  polarization  are  clearly  more 
complicated  than  //-mode  coupled-wave  Eqs.  (8).  The  two 
additional  terms  in  Eq.  (13)  both  contain  a  aeries  in  Ax  snd 
U,-h-  Whereas  the  H -mode  equations  contain  only  S, _  i ,  S, , 
and  amplitude  terms,  the  £-mode  equations  contain 
U,~  i,  U„  t/,*i,  and  (/,_»  amplitude  terms.  Therefore  in  the 
H-mode  case  there  it  direct  coupling  only  between  adjacent 
orders,  but  in  the  £-mode  rase  there  is  direct  coupling  among 
all  diffracted  orders.  Although  the  number  of  terms  in  the 
£-mode  case  is  larger,  the  resulting  coupled-wave  equations 
may  be  solved  by  the  state-variables  method  in  exactly  the 
same  manner  as  in  the  H -mode  case. 


BOUNDARY  CONDITIONS 

At  the  boundaries  of  the  grating  (z  •  0  and  z  •*  d),  the  tan¬ 
gential  components  of  the  electric  field  and  the  magnetic  field 
must  be  continuous.  In  this  way,  tne  field  of  eaeh  diffracted 
order  outside  the  grating  volume  is  related  to  the  corre¬ 
sponding  space-harmonic  field  inside  the  grating.  Thus,  in 
order  to  construct  the  boundary  conditions,  the  tangential 
components  of  £  and  H  must  be  determined. 

//-Mode-Polarization  Tangential  Ffeldx 
For  //-mode  polarization,  the  tangential  component  of  the 
electric  field  is  the  y  component  of  £,  and  it  is  given  by  Eq.  (7) 
directly.  The  values  of  S,  (0)  and  S,(d)  needed  in  Eq.  (7)  are 
obtaine  i  by  solving //-mode  coupled-wave  Eq.  (8)  for  S,(z|. 
The  tangential  component  of  the  magnetic  field  is  Ih-  x 
component  of  H.  It  may  be  obtained  from  the  Maxwell  curl 
equation  v  X  E  *  -dB/St.  The  result  is  H,  «  (— >/ 
wp)d£«/dt,  and,  together  with  Eq.  (7),  the  tangential  magnetic 
Field  is 


and  thus  ■  mly  a  y -component  equation  exists.  To  put  this 


H,  ■  (->/wji)  —  £  S,(x) ezp(->o, -PI. 
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Pig.  i.  Diffraction  effete  nor*  of  forward-diffracted  wavw*  tor  a 
louiesi  ♦  -  120*  ilantad  (rating  (120*  angle  from  r  alia  to  grating 
vector)  for  both  //mode  and  E-mode  poUriiationa.  The  average 
permittivity  inside  and  outside  the  grating  a  the  name  (r  -  2.251  The 
angle  of  incidence  P  -  42*  i»  at  the  first  Bragg  angle  (m  •  I).  For 
these  conditions  the  t  -  - 1  Held  is  evanescent  (cut  off).  The  modu¬ 
lation  ia  r |/ro  w  0. 1 20.  The  diffraction  efficiencies  for  all  diffracted 
waves  not  shown  are  less  than  0.01.  (a)  Rigorously  calculated  results. 
The  fields  i  ■  -4  to  i  •  +5  were  retained  to  achieve  convergence  in 
field  amplitudes,  (b)  Multiwave  first -order  coupled-wave  theory 
results,  showing  the  effect  of  neglecting  second  derivatives  and 
boundary  effects.  Notice  that  the  diffraction  efficiency  of  the  i  - -I 
field  is  predicted  to  be  as  large  aa  9%  even  though  this  wave,  in  fact, 
is  evanescent!  The  fields  i  •  -4  to  +S  were  retained  to  achieve 
convergence  in  the  field  amplitudes.'  (c)  Two-wave  (i  ■  0,  +1  i  vec- 
ond -order  coupled-wave  theory  results  showing  the  effect  of  neglecting 
higher-order  waves.  ' 


£-Mode-Polarlzatlon  Tangential  Fields 
For  E  mode  polarization,  the  tangential  component  of  the 
magnetic  field  is  the  y  component  of  H,  and  it  is  given  by  Eq. 
(12)  directly.  The  values  of  17,(0)  and  17,  Wl  to  be  used  in  Eq. 
(12)  are  obtained  by  solving  £-mnde  coupled-wave  Eq.  (13) 
for  17, (r ).  The  tangential  electric  field  is  the  z  component  of 
E.  It  may  be  obtained  from  the  other  Maxwell  curi  equation 
▼  X  H  -  ST}/dt.  The  result  is  E ,  ■  \jlwtatU,  z)|dW,/dr. 
Expanding  l/r(x,  z)  into  a  Fourier  series  gives 


^7)"Jt£.C**llp0hK-F),  (15) 

where  G*  -  |((eo/r,)1  -  l|t«  -  (<„/<, )|IM/(<07  -  <,7)177. 
Substituting  this  into  the  above  equation  for  E,  and  using  H, 
as  given  by  Eq.  (12)  yields  the  tangential  tlectfic  field  as 


E,  “  O'/wro)  £  ezp(-;'ff,  •  F) 


*14.  C*  |^dT^  ^7<5.-h'<)t/.-»(7)J.  U6> 


DISCUSSION 

The  rigorous  scalar  coupled-wave  equations  describing  dif¬ 
fraction  by  planar  lossy  dielectric  gratings  have  been  pre¬ 
sented  for  both  H-mode  end  E-mode  polarizations  as  derived 
from  the  general  vector  wave  equations.  The  resulting  cou¬ 
pled-wave  equations  for  both  cases  can  lie  solved  in  the  same 
manner  by  using  state-variable  methods.2  By  using  these  in 
combination,  any  arbitrary  input  polarization  may  thus  be 
treated. 
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Fif.  2.  Diffraction  efficiencies  of  forward-diffracted  and  bark 
ward -diffracted  warn  for  a  lossless  4  »  150*  slanted  grating  for  bolt, 
H- mod*  and  £-modr  polarization.  The  average  permittivity  inaidc 
and  outaide  the  (ratine  a  the  tune  (r  -  2.251.  The  angle  of  incidence 
F  »  20'  ia  at  the  Hrat  Bran  angle  (m  »  1 ).  The  modulation  ia  «i/«o 
■  0.330,  The  diffraction  efficienciee  for  all  diffracted  waves  not 
shown  are  lew  than  0.01.  fa)  Rigorously  calculated  reaulta.  The 
fields  i  ■  -4-+5  were  retained  to  achieve  convergence  in  field  am- 
pliludea.  fb)  Multiwave  rirat-order  coupled-wave  theory  reaulta, 
•howing  the  effect  of  neglecting  second  derivativea  and  boundary 
effects.  The  fields  i  ■  -4-4  5  were  retained  to  achieve  convergence 
in  field  amplitudes.  (c>  Two-wave  (i  •  0,  el)  second -order  cou¬ 
pled-wave  theory  reaulta,  showing  the  effect  of  neglecting  higher -order 
waves. 


Foe  H-mode  polarization,  the  well-known  retult  of  direct 
coupling  only  between  adjacent  diffracted  ordera  is  obtained. 
However,  for  C-modt-  polarization,  it  has  been  shown  that 
direct  coupling  exiata  among  all  diffracted  orders.  The  set 
of  boundary-condition  equations  for  each  polarization  is  a  aet 
of  linear  algebraic  aquations,  and,  after  the  coupled-wave 
equations  ere  solved,  these  may  then  be  solved  for  the 
phase-matched  propagating  and  evanescent  wave  amplitudes 
outaide  the  grating. 

Numerous  calculations  have  been  performed  to  obtain  the 
fundamental  and  higher-order  forward  and  backward-dif¬ 
fracted  wave  amplitudes  for  both  H -mode-polarization  and 
E-mode -polarization  incident  wave*.  Results  for  an  example 
lossless  transmission  grating  are  shown  in  Fig.  1.  The  rigor¬ 


ously  calculated  diffraction  efficiencies  of  several  forward- 
diffracted  orders  are  shown  in  Fig.  1(a).  The  power  in  the 
E-mode  diffracted  waves  is  initially  less  than  that  for  the 
H-mode  polarization  because  of  the  reduced  coupling  in  £ 
mode  compared  with  H  mod e.  However,  as  the  thickness  is 
increased,  the  E-mode  fundamental  (+11  diffraction  efficiency 
exceeds  the  corresponding  H-mode  diffraction  efficiency. 
Diffraction-efficiency  results  from  multiwave  first-order 
coupled-wave  theory*  are  shown  for  comparison  in  Fig.  1  (b). 
In  this  half-space  theory,  second  derivatives  of  the  field  am¬ 
plitudes  and  boundary  effects  are  neglected.  Thus  a  field  that 
ia  in  fact  evanescent  (cut  off)  is  still  trested  as  s  propagating 
wave.  In  Fig.  l.thei  »  -1  field  is  evanescent.  However.this 
field  ia  predicted  by  multiwave  first-order  coupled -wave 
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Fig.  3.  Rigorously  calculated  diffraction  efficiencies  of  forward  - 
diffracted  waves  for  a  lossy  o  •  120*  slanted  grating  for  both  H  -mode 
and  C-mode  polarizations.  The  average  conductivity  is  oo  ■  400 
(ohm  -  m)"'.  The  angle  of  incidence  O'  ••  20*  h  at  the  first  Bragg 
angle.  The  modulation  »#,/<„  -  0. 1 20.  and  the  wavelength  X  *  5145 
nm.  The  average  permittivity  outaide  the  grating  is  the  same  aa  that 
inside  (e  »  2.25). 

theory  to  have  a  diffraction  efficiency  as  Urge  as  VI,  even 
though  this  wave  is  not  propagating!  Diffraction-efficiency 
results  from  two-wave  second-order  coupled-wave  theory4  are 
shown  for  comparison  in  Fig.  1(c).  In  this  theory,  the 
higher-order  waves  are  neglected,  and  so  only  the  i»0  and  i 
«  +1  fields  are  shown  in  Fig.  1(c). 

Results  for  an  example  lossless  reflection  grating  are  shown 
in  Fig.  2.  The  incident  wave  is  at  the  Bragg  angle  for  i  »  + 1 
backward-diffracted  wave.  Rigorously  calculated  diffraction 
efficiencies  are  shown  in  Fig.  2(a).  Diffraction  efficiencies 
from  multiwave  first-order  coupled-wave  theory  are  shown 
in  Fig.  2(b).  The  poor  agreement  with  rigorously  calculated 
results  is  apparent  and  is  expected  for  a  reflection  grating  (see 
Ref.  1).  Diffraction-efficiency  resulU  from  two-wave  sec¬ 
ond-order  coupled-wave  theory  are  presented  in  Fig.  2(c). 
The  good  agreement  of  the  H -mode-polarization  results  with 
rigorously  calculated  results  is  apparent  and  is  expected  for 
a  reflection  grating  (see  Ref.  1).  However,  for  E-mode  po¬ 
larization,  the  presence  of  coupling  to  all  higher-order 
(space-harmonic)  fields  (not  just  to  adjacent  orders,  as  in  the 
case  of  //-mode  polarization)  causes  this  two-wave  theory  to 
give  erroneous  results  for  this  polarization.  The  i  •  +1  fun¬ 
damental  backward-diffracted  (reflected)  wave  for  £-mode 
poUrization  is  predicted  by  this  two-wave  theory  to  be  much 


smaller  than  it  actually  is  Thia  is  a  result  of  artificially  re¬ 
stricting  tbs  coupling  to  be  between  the  i  ■(>  and  +1  space- 
harmonic  fields  rather  than  among  all  space-harmonic 
fields. 

It  may  thus  be  concluded  that,  tc  obtain  accurate  results 
for  E-mode  poUrization,  it  it  necessary  to  include  higher-order 
space-harmonic  fields  regardless  of  whether  the  fundamental 
propagating  order  is  forward  or  backward  diffracted.  In  ad¬ 
dition,  if  the  fundamental  propagating  order  is  backward 
diffracted,  the  second  derivatives  and  boundary  effects  need 
to  be  included  for  accurate  resulU  for  both  H- mode  and  £• 
mod*  polarizations. 

Rigorously  calculated  diffraction  efficiencies  for  an  example 
lossy  grating  are  shown  in  Fig.  3.  The  nonzero  conductivity 
produces  an  average  absorption  that  reduces  all  the  diffracted 
intensities,  as  would  be  anticipated. 

In  separate  calculations,  it  was  found  that  both  H-mode  and 
E-mode  poUrization  diffraction  efficiencies  reduce  to  the 
values  predicted  by  Kogetnik'z  two-wave  first-order  cou¬ 
pled- wave  theory4  in  the  limit  of  sufficiently  small  moduU- 
tion. 

Coupled-wave  analysis  is  based  on  the  Floquet  condition 
and  aa  such  applies  to  a  truly  periodic  grating  (an  infinite 
number  of  periods).  If  the  grating  fringes  are  exactly  parallel 
to  the  boundaries  (p  2  0).  the  structure  is  no  longer  periodic, 
and  coupled-wave  analysis  does  not  apply.  In  this  case, 
however,  a  simple  rigorous  chain-matrix  method  of  analysis 
may  be  used.6 

The  generalized  rigorous  coupled-wave  analysis  presented 
here  is  mathematically  exact.  There  are  no  theoretical  defi  - 
cienciet  or  approximations.  Any  arbitrary  level  of  accuracy 
is  obtainable  by  increasing  the  number  of  orders  retained  in 
the  analysis.  However,  convergence  is  very  rapid.  In  the 
numerical  calcuUtions  presented  here,  the  diffracted  ampli¬ 
tudes  were  determined  to  one  part  in  10*.  Conservation  of 
power  among  the  beams  was  accurate  to  one  part  in  ID12.  It 
should  be  recognized  that  this  level  of  accuracy  greatly  exceeds 
that  usually  presented  in  grating-diffraction  calcuUtions. 
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Diffraction  by  an  arbitrarily  or  tented  planar  (rating  with  tianted  fringes  ia  analyzed  using  riforoui  Ihree-dimen- 
aional  vector  co  ipled  wave  analysis.  The  method  appliea  to  any  linuaoidal  or  nonainusoidal  amplitude  and/or 
phaae  grating,  an '  plane-wave  angle  of  incidence,  and  any  linear  polarization.  In  the  resulting  (conical)  diffraction, 
it  ia  ahown  that  c  tupling  eziata  between  all  apace-harmonic  vector  flelda  inside  the  grating  (corresponding  to  dif¬ 
fracted  orders  outride  the  grating).  Therefore  the  TE  and  TM  components  of  an  incident  wave  are  each  coupled 
to  all  the  TE  and  TM  components  nf  all  the  forward-  and  backward-diffracted  waves.  Tor  a  general  Bragg  angle 
of  incidence,  it  ia  shown  that  the  diffraction  efficiency  can  approach  HXA  for  a  Inasleu  grating  if  either  the  incident 
electric  field  or  the  magnetic  field  ia  perpendicular  to  the  grating  vector.  Mazimum  coupling  between  incident  tnd 
diffracted  waves  is  ahown  to  occur  when  the  incident  electric  field  is  perpendicular  ('the  grating  vector.  In  gener¬ 
al.  the  diffracted  waves  are  shown  to  be  elliptically  polarized.  The  three-dimensional  vector  coupled-wave  analysis 
presented  is  shown  to  reduce  to  ordinary  rigorous  coupled  wave  theory  when  the  grating  vector  lies  in  the  plane  of 
incidence. 


INTRODUCTION 

Planar  amplitude  and  phase  gratings  are  cf  wide  interest 
owing  to  their  many  applications  in  quantum  electronics,  in¬ 
tegrated  optics,  acousto-optics,  spectroscopy,  and  holography. 
Example  grating  devices  indude  distributed-feedback  lasers, 
beam  deflectors,  beam  modulators,  waveguide  couplers, 
spectral  filler?,  wavelength  multiplexers  and  demultiplexers, 
and  holographic  beam  combiners. 

The  tnmt  common  methods  of  analyzing  planar  grating 
diffraction  are  the  coupled-wave  approach1*  and  the  modal 
approach.1'1'1*  These  investigations  were  restricted  to  the 
case  of  a  grating  vector  lying  in  the  plane  of  incidence  (the 
plane  defined  by  the  wave  normal  and  the  boundary  normal ). 
In  thia  situation,  the  TE  (electric  field  perpendicular  to  the 
plane  of  incidence)  and  the  TM  (magnetic  field  perpendicular 
to  plane  of  incidence)  components  of  the  input  plane  wave  are 
completely  decoupled  and  may  be  treated  separately.  In  this 
special  case.  (i)  if  th<-  incident  plane  wave  has  TE  polarization, 
the  grating-diffraction  problem  ia  described  as  having  H -mode 
polarization  since  the  magnetic  field  lies  in  the  plane  of  the 
wav*  normal  and  the  grating  vector  (the  electric  field  is  per¬ 
pendicular  to  the  grating  vector)  and  (2)  if  the  incident  plane 
wave  haa  TM  polarization,  the  grating-diffraction  problem 
ia  described  as  having  £ -mode  polarizat  ion  since  the  electric 
field  lies  in  the  plane  of  the  wave  normal  and  the  grating  vector 
(the  magnetic  field  is  perpendicular  to  the  grating  vector). 
However,  in  the  genera)  case,  as  treated  in  this  paper,  the 
grating  vector  may  have  any  arbitrary  orientation  with  respect 
to  the  plane  of  incidence.  In  this  situation,  the  TE  end  TM 
component*  of  the  input  plane  weve  ere  coupled  inside  the 
grating  tnd  may  not  be  treated  separately.  In  this  general 
case  the  grating-diffraction  problem  may  not  be  decomposed 
into  separate  TE-  end  TM -polarization  problems,  as  is  usually 
dona. 


The  description  of  grating  diffraction  as  a  direct  solution 
of  Maxwell's  equations  haa  been  considered  by  Nevifre  et 
at.  P  22  by  Chang  et  at.  P  and  by  Knop.24  Two  first-order 
Maxwell  equations  were  solved  directly  rather  than  by  the 
usual  procedure  of  solving  a  single  second -order  wave  equa¬ 
tion.  In  these  analyses,  the  grating  vector  was  restricted  to 
lie  in  the  plane  of  incidence.  The  general  three-dimensional 
case  in  which  the  grating  vector  is  not  in  the  plane  of  incidence 
is  sometime*  called  conical  diffraction  (tor  reasons  described 
below).  This  cue  has  been  discussed  by  Maystre25  and  has 
been  treated  using  a  Green  function  approach  by  Chuang  and 
Kong.26  The  present  work  combines  the  direct  solution  of 
Maxwell's  equation  and  the  general  three-dimensional  dif¬ 
fraction  geometry. 

THEORY 

The  general  three-dimensional  grating-diffraction  problem 
ia  depicted  in  Fig.  I.  A  linearly  polarized  electromagnetic 
wave  ia  obliquely  incident  at  an  arbitral  angle  a  on  a 
slanted-fringe  nonsinuaoidal  mixed  amplitude  and  phase 
planar  grating  of  slant  angle  *  bounded  by  two  different  ho¬ 
mogeneous  media.  The  planar  grating  has  an  arbitrary  di¬ 
rection  of  periodicity  (direction  of  grating  vector  K).  There 
are  four  fundamental  directions  that  sixcify  this  grating- 
di (fraction  geometry:  (I)  the  wave-norn  xl  direction  of  the 
incident  wave,  (2)  the  electric-field  direction  (polarisation) 
of  the  incident  wave,  (5)  the  normal  to  the  planar  grating 
boundaries,  and  (4)  the  grating  vector.  In  the  analysis  pre¬ 
sented  her*,  without  any  loss  of  generality,  the  following  ge¬ 
ometry  it  used:  ( 1)  the  boundary  normelt  ere  in  the  r  direc¬ 
tion,  (2)  the  grating  vector  is  in  the  r-r  plane,  and  (3)  the  plane 
of  incidence  makes  an  angla  I  with  respect  to  the  z  axis. 

The  modulated  region  (0  <  z  <  d)  contains  a  mixed  am¬ 
plitude  end  phase  grating.  The  grating  may  be  characterized 
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Fig- 1.  Geometry  of  a  slanted  fringe  planar  grating  with  a  plane  wave 
of  wave  vector  k ;  incident  at  an  arbitrary  angle  and  with  arbitrary 
linear  polarization. 

by  a  periodic  complex  relative  permittivity  (dielectric  con¬ 
stant)  expandable  in-a  Fourier  series  as 

r(x,r)  *  £i»  expO'/tK*  r).  (1) 

a 

The  quantity  i*  is  the  hth  Fourier  component  of  the  complex 
relative  permittivity  and  is  given  by  i»  «  rk  -  / <r»/o>< o.  where 
rA  and  o*  are  the  hth  Fourier  components  of  the  real  dielectric 
constant  and  the  conductivity,  respectively.  The  quantity 
u  is  the  angular  frequency  of  the  incident  radiation,  and  t (>  is 
the  permittivity  of  free  space.  The  grating  vector  is  given 
by 

K-K.J+  K,! 

-  K  sin  pt  r  K  cos  of,  (2) 

where  K  *  2t/A,  A  it  the  grating  period,  and  6  is  the  alant 
angle  (angle  between  the  z  axis  and  the  gratinr  vector). 

In  region  1,  the  incident  normalized  electric-field  vector 

it 

Em  -  d  exp(-/ki  •  r),  (3) 


k|  “A|(»ino  cot  it  +  tin  nr  sin  tf  +  coe  of).  (6) 

where  a  is  the  angle  of  incidence  (the  angle  between  the  wave 
normal  k|  and  the  z  axis),  1  is  the  angle  between  the  plane  of 
incidence  and  the  z  axis,  *i  "  kti,u,  ti  i*  the  relative  per¬ 
mittivity  in  region  1.  *  »  2rA,  X  ia  the  free-tpace  wavdjogth, 
and  d  it  the  polarization  unit  vector  given  by 

A  «  u,t  +  u,f  +  u,t 

•  (coe  tl  cot  a  cos  i  -  tin  ^  tin  i)J 
♦  (cue  coe  a  sin  t 
+  sin  $  cos  4)>’ 

—  cos  ^  sin  of.  (5) 

where  $  is  the  angle  betwsen  the  polarization  vector  and  the 
plane  of  incidence.  For  ^  «  O'  and  i  *  90*,  the  magnetr  field 
and  the  electric  Field,  respectively,  ate  perpendicular  to  the 
plane  of  incidence.  The  general  approach  to  solve  the  exact 
electromagnetic  boundary  value  problem  associated  with  the 
diffraction  grating  is  to  find  solutions  that  satisfy  Maxwell’s 
equations  (or  the  corresponding  wave  equations)  in  each  of 
the  three  regain*  and  then  to  match  the  tangential  electric  and 
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magnetic  Fields  at  the  two  boundaries  (z  -  0  and  r  •  d).  In 
the  general  three-dimensional  problem,  the  polarization 
cannot  be  decomposed  into  If -mode  and  E-mode  components 
with  each  of  these  treated  separately  and  then  the  results 
Combined  to  obtain  the  total  diffracted  field.  All  the  field 
components  arc  coupled  to  one  Mother,  and  solutions  for  all 
the  electric-Field  and  magnetic-field  components  have  to  be 
obtained  simultaneously.  The  normalized  total  vector  electric 
Held  in  region  l  (z  <  0)  and  in  region  3  (z  >  d)  may  be  ex¬ 
pressed  at 

Ei  -  Roc  +  E  exp(-/ku  ■  r),  (6) 

i 

Es  «  £  T,-  exp(-;kjj  •  (r  -  d)|,  (7) 

i 

where  K,  is  the  normalized  vector  electric  field  of  the  ith 
backward-diffracted  (reflected)  wave  in  region  1  with  wave 
vector  ki,  end  T,  it  the  normalized  vector  electric  field  of  the 
ith  forward-diffracted  (transmitted)  wave  in  region  3  with 
wave  vector  It*.  Note  that,  for  plane  waves,  fcu  •  K,  •  0  “  k* 
.  T,.  Phase  matching  and  the  Floquet  theorem  require 
that 

kr,  -  |(k,  - iK) - i\i  +  I(k,  - 1 K)  •  yi?  +  A ,i,t 

-M +  *»*  +  ***.  <8> 


A„  -  A|  sin  a  coe  1  -  iK  tin  6,  (9) 

Ay  «  At  tin  a  tui  4,  00) 

*t*  ■  (At*  -  A„*  -  Ay*),/*  on 

for  /  *  1, 3  (the  region  index),  Aj  -  Ariii*7*,  and  im  ia  the  av¬ 
erage  relative  pemittivity  in  region  3.  The  z  component  of 
the  wave  vector.  A,*,  is  either  positive  real  (a  propagating 
wave)  or  negative  untginary  (an  evanescent  wave).  likewise, 
for  region  I.  A,,,  is  either  negative  real  (propagating  wave)  or 
positive  imaginary  (evanescent  wave). 

The  geometrical  parameters  associated  with  the  diffracted 
waves  are  shown  in  Fig.  2.  Region  3  is  shown  in  Fig.  2,  but  the 
parameters  shown  also  apply  to  the  diffracted  waves  in  region 
I.  The  angle  of  diffraction  for  the  ith  propagating  order  ia 
given  by 

tan  flu  m  (Ah’  ♦  ky')mtk,u,  (12) 


Fitf.  2.  Gavmatry  with  t  lie  i  th  forward  -diffracted  wait. 
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Fit-  a  Ceomeliy  of  forward-diffracted  wave  vectors  showing  conical 
nature  of  diffraction.  All  forward-diffracted  waves  (i  ■  - 1  to  i  «  +21 
have  wave  vectors  that  are  equal  in  magnitude  and  have  the  tame  y 
component. 


and  the  angle  of  inclination  of  the  output  plane  is  given  by 
tan  i,  -  k,/k„.  (13) 

The  plane  of  diffraction,  in  general,  is  different  for  each  dif¬ 
fracted  order,  as  shown  in  Fig.  2.  In  the  limiting  case  when 
the  plane  of  incidence  is  the  x-x  plane.  h»  -  0  and  all  dif¬ 
fracted  orders  lie  in  the  same  plane  (the  plane  of  incidence). 
However,  if  the  plane  of  incidence  does  not  contain  the  grating 
vector,  the  A,  is  a  nonzero  constant.  The  wave  vectors  of  all 
diffracted  orders  (forward  and  backward)  have  the  same  y 
component  (perpendicular  to  the  grating  vector).  This  is 
more  clearly  shown  in  Fig.  3,  in  which  the  forward -diffracted 
waves  in  region  3  are  depicted.  T  he  magnitude  of  the  wave 
vectors  for  all  diffracted  waves  is  *i.  This,  together  with  a 
constant  value  for  ky  for  all  diffracted  orders,  means  that  the 
wave  vectors  lie  on  the  surface  of  a  cone  (with  the  cone  axis  in 
the  y  direction);  hence  the  terminology  "conical  diffraction" 
for  this  general  three-dimensional  geometry.  (In  general,  the 
cone  axis  is  in  the  it  x  K  direction,  whete  rf  is  the  normal  to 
the  boundary.) 

The  magnetic-field  vector  in  regions  1  and  3  can  be  obtained 
by  using  the  Maxwell  equation 

H  «  (/Aupolv  X  E,  (14) 

where  m>  is  the  permeability  of  free  space,  which  is  the  as¬ 
sumed  permeability  in  all  regions. 

In  the  modulated  region  <0  <  z  <  d),  the  electric  and  mag¬ 
netic  fields  may  be  expressed  as  Fourier  expansions  in  terms 
of  the  space-harmonic  fields  at 


“  E  IS„M*  ♦  +  S„(r)f|exp(->e,  •  r),  (15) 

I 


Hj  -  Wpo)'*  £  lU„(r)l  +  (7„(z)* 

+  l/„U)i]exp(-;>,  •  e). 

(16) 

where 

•,  -  k„t  +  kj  -  iK,t. 

(17) 

The  x  and  y  components  of  e,  are  determined  by  the  Floquet 


theorem  and  the  phase-matching  condition;  the  r  component 
k  arbitrary  and  can  be  included  in  the  S,  (r )  and  U,  (x )  func¬ 
tions.  The  z  component,  however,  it  chosen  in  Eq.  (17)  so  that 
the  differential  coupled-wave  equations  to  be  derived  later 
will  have  constant  coefficients  and  will  thus  be  directly  solv¬ 
able  by  the  state-variable  method.  S,(r)  and  U,  (r  I  are  the 
normalized  amplitude*  of  the  ith  space-harmonic  vector 
electric  end  vector  magnetic  fields  such  that  and  H?  satisfy 
Maxwell's  equations  (or  the  appropriate  wave  equations  de¬ 
rived  from  Maxwell's  equations)  in  the  grating  region. 

For  the  case  when  d  «  0,  the  grating  vector  is  normal  to  the 
boundary.  Such  gratings  are  called  pure  reflection  gratings. 
The  grating  fringe*  (surfaces  of  constant  c)  in  this  case  are 
parallel  to  the  grating  boundaries  (z  »  0,  d).  Since  the  per¬ 
mittivity  is  no  longer  periodic  along  the  boundary,  the  field 
inside  the  grating  can  no  longer  be  expanded  in  terms  of 
space-harmonic  components.  However,  this  pure-reflec- 
tkm-grating  case  can  be  simply  analyzed  without  approxi¬ 
mation  by  using  a  rigorous  chain-matrix  method  of  anal¬ 
ysis.*7 

METHOD  OF  SOLUTION 

In  the  general  three-dimensional  vectorial  problem  under 
consideration,  all  the  electric  and  magnetic  space-harmonic 
fields  are  coupled  to  one  another,  Therefore.  rather  than 
attempting  to  construct  and  solve  two  complicated  vector 
wave  equations,  it  is  more  convenient  and  straightforward  to 
solve  Maxwell's  equations 

FXEj*  -ju-MtBj.  (18) 

»XH)«  jut  or(x.  x)Ej  (19) 

directly.  Substituting  Eqs.  (15)  and  (16)  into  these  two 
equations,  and  eliminating  the  components  of  Ej  and  H; 
normal  to  the  boundary,  results  in  a  set  of  four  first -order 
coupled-wave  equation*: 

-  -/ [ifC,S„(r)  +  (*„/*)  Z  o,.„[ktU„U) 

-*.rt/,p(r)l  +  «/,.(r)j.  (20) 

"  -/  {iAf,S„(i)  -  kU.,U) 

+  (*,/*)  Z  «,-,|*,t/„(r>  -  *„Uw(r))J. 

(21) 

~ ^  -  /  {(*,./»)M.,(e)  -  *„S„(e)l 

♦*£f.-A,(*)-i/C,t/.,<  )}.  (22) 

«!</„<<)  I,  „ 

— "  -l  j*Ef.-eS.e(*l-(V*H*A,<*> 

-*»S„(z)|4ilf,l/,I(z)j.  (£3) 

where  p  *  i  -  h  and  »»  is  the  Mh  coefficient  of  the  Fourier 
expansion  of  c^U,!)  in  the  form 

r-'(x,r)  m  Zat,  expO'hK- r).  (24) 
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Not*  that,  whan  tha  grating  vector  ia  in  tha  plane  of  inei- 
danoa.  k,  ■  0  and  the  eoupled-wava  equations  (Eqs.  (20M23H 
are  reduced  to  two  eete  of  coupled-wave  aquations;  the  tint 
eet  [Eqs.  (20)  and  (23)1  gives  the  solution  for  the  £-mode- 
polarization  case,  and  the  second  set  (Eqs.  (21)  and  (22))  gives 
the  solution  fir  ff-mode  polarization. 

The  coupled-wave  equations  jEqa.  (20)— (23)]  may  be 
written  in  a  math*  form  as 


or  in  compact  form  as 

V  -  AV,  (26) 

where  Vis  a  vector  composed  of  Sn.Syj.Uu,  and  The 
coefficient  matrix  A  is  the  system  matrix  composed  ot  the  16 
submatrices  in  Eq.  (25)  that  are  in  turn  specified  by  the  4  sets 
of  coupled-wave  equations. 

Equation  (26)  may  be  solved  using  a  state-variable  method 
(described  in  detail  in  previous  publications*-2*)  by  calculating 
the  eigenvalues  and  eigenvectors  associated  with  the  matrix 
A.  The  solutiona  of  the  coupled-wave  equations  using  the 


state-variable  method  may  be  expressed  as 

Sy.fzl-LC.nicomexplXnr).  (27) 

m 

S,.(z)  ■  E  exp(A„r).  (28) 

m 

U.M  “  £  C„io3j„  exp(A„r),  (29) 

m 

U>M  "  E  C„wUm  exp(A„z),  (30) 

m 


where  C„’s  are  the  unknown  constants  to  be  determined  from 
boundary  conditions.  A„'s  are  the  eigenvalues  of  the  matrix 
A.  and  u>,  jw's  are  the  elements  of  the  eigenvector  matrices 
corresponding  to  a  given  van  le  of  i  (space-harmonic  field  in¬ 
side  the  grating  or  diffracted  order  outside  the  grating).  The 
eigenvalues  and  eigenvectors  are  typically  calculated  by  using 
a  computer  library  program.29  Note  that,  if  n  space  har¬ 
monics  (values  of  i)  are  retained  in  the  analysis,  the  matrix  A 
will  be  4n  X  li  and  the  vector  V  will  be  of  length  4n.  This 
system  of  equations  produces  4n  eigenvalues  and  4n  values 
of  the  unknown  constants  C„.  and  esch  of  the  four  eigenvector 
submatrices  (q  ■  1, 4)  U’„j„  will  be  an  n  X  4n  matrix  (n  values 
of  i  and  4n  values  of  m). 

The  amplitudes  of  the  diffracted  fields  R,  andT,  (together 
with  Cn )  are  calculated  by  matching  the  tangential  electric 
and  magnetic  fields  at  the  two  boundaries.  At  z  ■  0, 

«.4i0  +£..  «S..(0),  <311 

Uyi.o  +  E.,  «SV1(0),  (32) 

i.o(*.u,  -  *i  cos  au,)  -  k.uRyi  +  k,R„  -  *(/«■( 0). 

(33) 

6,o(*i  cos  au,  -  k.ou,)  +  k,,,R„  -  k„R„  -  AC/„(0). 

(34) 

At  z  «  d, 

(35) 
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Tyi  *  S„  (d  )exp0iK,d),  (36) 

~K*Tyi  4-  kyTu  m  kU,i(d)tip(JiK,d),  (37) 
k,*T.i  -  hiTji  w  kU,,  (d)ezp(/i Ajd).  (38) 

Note  that,  since  k,c  R.-0  and  kji-T. -0,  then 

*4  AyEj,  +  A,  i,Rri  ■  0,  (39) 

*.<T„  4-  kyTyi  +  k,\T„  -  0.  (40) 


The  system  of  simultaneous  linear  equations  given  by  Eqs. 
(31 M40)  may  be  solved  for  R,  and  T,  (by  using  a  technique 
surh  as  Gauss  elimination  with  the  maximum  pivot  strate¬ 
gy”).  Note  that  the  number  of  equations  is  exactly  equal  to 
the  number  of  unknowns.  For  examplr,  if  n  waves  (values  of 
i)  art  retained  in  the  analysis,  there  will  be  4n  values  of  C„ 
and  3n  components  of  each  of  R,  and  T,.  Thus  the  total 
number  of  unknowns  is  Ifln,  which  is  exactly  the  number  of 
equations  given  by  Eqs.  (31 H40).  The  computational  time 
and  storage  requirements  may  be  reduced  appreciably  by 
eliminating  R,  and  T,  from  Eqs  (31 M38)  and  solving  for  the 
Cm's  and  then  calculating  R,  and  T,. 

The  diffraction  efficiency  is  defined  as  the  ratio  of  the 
component  of  the  real  power  carried  by  the  diffracted  wave 
normal  to  the  boundary  (z  component)  to  the  corresponding 
component  of  the  real  power  associated  with  the  incident 
wave.  That  it, 

PE„  -  -Ref*.,,/*,  co.XR.1  *.  (41) 

DEj,  -  Re(A,a/*i  coaXT.I*  (42) 

where  DEt,  and  DEj,  are  the  diflVaction  efficiencies  of  the 
backward-diffracted  (region  1)  and  forward-diffracted  (region 
3)  waves  in  the  directions  ki,  and  k*.  respectively. 

Power  conservation  requires  that  for  lossless  phase  gratings 
the  sum  of  the  efficiencies  for  ail  the  propagating  waves  be 
unity.  That  is 

E  (DEi,  +  DEs)  -  1-  (43) 

It  is  important  to  note  that  Eq.  (43)  is  satisfied  for  lawless 
phase  gratings  independently  of  the  number  of  waves  included 
in  the  analysis.  Thus  it  sums  to  unity  for  any  number  of  space 
harmonics  retained,  independently  of  whether  the  corre¬ 
sponding  fields  outside  the  grating  are  propagating  or  evan¬ 
escent.  Any  significant  deviation  in  the  sum  would  indicate 
the  pretence  of  round-off  errors  in  the  numerical  calculations. 
However,  for  all  the  calculations  performed,  the  deviation  was 
of  the  order  of  10” 12  (when  a  CDC  760/730  computer  was 
used).  However,  the  accuracy  of  each  individual  order  de¬ 
pends  on  the  number  of  space  harmonics  retained  in  con¬ 
structing  the  A  matrix.  In  all  the  resulta  presented,  these 
errors  are  less  than  10'*.  It  should  be  noted  that  this  level 
of  accuracy  greatly  exceeds  that  usually  presented  in  grat¬ 
ing-diffraction  calculations. 

In  summary,  the  algorithm  used  to  solve  this  problem 
proceeds  as  follows:  First,  the  coefficient  matrix  A  is  con¬ 
structed.  Second,  the  eigenvalues  and  the  eigenvectors  are 
calculated.  Th:rd,  the  system  of  linear  equations  |Eqs. 
(31  )-(40>)  (or  a  modified  version  of  these)  is  constructed  and 
solved  for  the  R,' s  and  T,'».  Fourth,  the  diffraction  ef¬ 
ficiencies  are  calculated  using  Eqs.  (41)  and  (42). 

It  is  important  to  note  that,  if  it  is  desired  to  calculate  the 
diffraction  efficiencies  for  mother  polarization  (another  value 
of  the  new  diffracted  fields  R,(^|  and  T,<^)  i r—  *- 


T„  »  S„(dlexpQi£i<f), 
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collated  for  tha  now  polarization  ^  by  using  the  values  of  R, 
and  T,  for  any  two  noncollinear  pola/izaUona  («.(.,  f  t  and  ^i) 
and  tbo  following  relationships: 

R.(*>-|sm(#»-tf)IM*i> 

-  iio(^i  -  (44) 

T,<|M  ■  I»in(<f*  -  <t’Y?,('h) 

-  smith  -  ^)T,Wi)J/»in(^j  -  #,)•  (45) 

However,  the  values  of  R,(h).R,(th).T,(h).  and  T,!^j)  mu»t 
•till  be  determined  by  using  the  complete  three-dimeniional 
vector  theory  a*  described  in  this  paper.  This  does  not  imply 
that  the  problem  can  be  decomposed  into  uncoupled  TE  and 
TM  problems. 

The  above  method  of  solution  applies  equally  for  incidence 
at  a  Braca  angle  or  for  a  general  angle  of  incidence.  Bragg 
incidence  occurs  when  the  quantity  wi  given  by 

m  a  WK’Hk.oK,  4  Re(*¥„  -  *,0J  -  V>,/1*»l  (46) 
is  an  integer.  This  would  then  correspond  to  the  mth  Bragg 
condition. 


RESULTS  AND  DISCUSSION 

The  rigorous  three-dimensional  vector  coupled-wave  analysis 
presented  in  this  paper  describe*  the  diffraction  by  an  arbi¬ 
trarily  oriented  planar  grating  wit)  nted  fringes.  In  gen¬ 
eral,  the  grating  vector  does  not  lie  the  plane  of  incidence, 
and  conical  diffraction  results.  For  the  special  case  when  4 
"  0,  the  grating  vector  does  lie  in  the  plane  of  incidence,  and 
It  is  possible  to  compere  the  results  from  the  present  vector 
theory  with  previously  published  rigorous  scalar  theory  re- 
aulta.**  Example  results  for  4  ■  0*  are  shown  in  Fig.  4  for  an 
unslanted  phase  grating  with  e  grating  period  equal  to  the 
wavelength  of  tight  in  the  medium.  The  normalized  electric 
field  and  the  diffraction  efficiency  of  the  first -order  (i  -  41) 
forward -diffracted  wave  are  shown.  When  ^  «  90*  or  ^  -  O', 
the  incident  wave  has  TE  or  TM  polarization,  respectively. 
For  tl'  “  90*.  the  incident  TE  wave  is  coupled  only  to  TE  waves 
(such  as  the  i  ■  41  diffracted  TE  wave  shown  in  Fig.  4(a)|. 
However,  there  ts  no  coupling  of  the  incident  TE  wave  to  TM 
waves  (see  Fig.  4(b)).  Likewise,  if  the  incident  wave  has  TM 
polarization  (^  “  O'),  it  is  coupled  only  to  TM  waves.  There 


EE) 


<c>  n 

Fig.  4.  Characteristics  of  first  order  fi  "  411  forward  diffracted wtve for tnunslanfed  fringegratingld  •  90*1  withe  plane  wave  of  wavelength 
A  “  0.500  vm  incident  at  the  first  Hregg  angle  to  ■  :tlr*  I  with  the  grating  vector  lying  on  the  plane  of  incidence  <4  *  0*  I  The  grating  is  lossless 
and  has  an  average  relative  permittivity  of  m  ■  2.25  and  a  relative  permittivity  modulation  of  n  “0  01.  The  relative  permittivity  outside  the 
gralingregionielhesameastheaveragcrelativepermiuivityofthegrating.  lalTH  component  of  the  normahred  diffracted  elcctnc-fieM  amplitude 
for  various  incident  linear  polarization*.  *.bl  1  M  component  of  the  normallred  diffracted  electric  field,  tc)  Diffraction  efficiency  of  diffracted 
wave,  tdl  Angular  selectivity  for  a  grating  with  a  thickness  of  d  •  50  pm. 
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is  no  coupling  of  TM  wave*  to  TE  waves  in  this  4  ”  0  caos.  For 
the  intermediate  linear  polarization*  of  the  incident  plan* 
wave,  the  electric  field  can,  therefore,  be  decomposed  into  TE 
and  TM  component*,  and  the  diffraction  of  the**  component* 
may  be  treated  entirely  separately  | ***  Eqa.  (20>— ( 2-3)  with  *, 
—  0).  The  output  diffracted  field  may  then  be  obtained  by 
vector  addition  of  the  individual  diffracted  TE  and  TM 
component*.  The  resultant  diffraction  efficiencies  are  ahown 
in  Fig.  AM.  Since  the  conversion  of  incident  TE  to  diffracted 
TE  waves  and  the  conversion  of  incident  TM  to  diffracted  TM 
waves  have  different  coupling  strengths,  the  resulting  TE  and 
TM  diffracted  wave  amplitude*  change  at  different  rate*  with 
respect  to  grating  thickness  [comoare  Figs.  4(a)  and  4(b)]. 
This  out-of-phase  behavior  (with  respect  to  grating  thickneaa) 
causes  the  total  normalized  diffracted  field  to  be  leu  than 
unity,  end  thus  the  diffraction  efficiency  doe:  not  reach  100% 
with  increasing  thickness  (see  Fig.  4(c)|.  However,  for  an 
incident  polarization  that  is  purely  TE  (and  thus  E  is  per¬ 
pendicular  to  K  for  4  “  0*1  or  purely  TM  (H  perpendicular 
to  K  for  4  »  0*).  the  resultant  diffraction  efficiency  will  ap¬ 
proach  100%  with  increasing  thickness  |see  Fig.  4(c)}.  For  a 
grating  of  50-pm  thickness,  the  diffraction  efficiency  nor¬ 


t*) 


malized  to  the  value  et  the  Bragg  angle  (30*)  is  shown  as  a 
function  of  angle  of  incidence  in  Fig.  4(d).  This  genera)  form 
of  angular  selectivity  it  well  known  for  thick  gratings.4  The 
angular  width  of  the  central  angular-selectivity  lobe  is  wider 
for  i'-zDode  polarization  fib  «  0")  than  for  H -mode  polariza¬ 
tion  because  the  smaller  coupling  strength  of  that  polarization 
produce,  less  dephasing  for  a  given  angular  deviation  from  the 
Bragg  angle.  All  the  numerical  vector  coupled-wave  analysis 
calculations  in  Fig.  4  have  been  repeated  using  scalar  rigorous 
coupled-wave  analysis  based  on  solving  scalar-wave  equa¬ 
tions*9  for  the  individual  TE  and  TM  components.  This 
method  of  analysis  duplicate*  the  results  ahown  in  Fig.  4. 

Example  reaulta  for  a  general-transmission  grating  when 
the  grating  vector  does  not  tie  in  the  plane  of  incidence  are 
shown  in  Fig.  5.  The  grating  and  the  wavelength  are  the  same 
as  those  in  Fig.  4.  However,  the  plane  of  incidence  is  now 
inclined  to  4  «  30*.  For  a  general  plane-of-incidence  incli¬ 
nation  angle  4,  the  Bragg  condition  (Eq.  (46)]  may  be  rewritten 
for  a  lossless  grating  as 

m  »  |2<ri ),/2A/X| (sin  n  sin  $  cos  4 

+  Uo/ci  -  sinan),/2  coa  «].  (47) 


*» 
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Fig.  5.  Characteristics  of  first-order  (i  •  +1)  forward  diffracted  wive  for  the  same  under, trdfnnve  grating  (e  •  90*)  as  in  Fig.  4  with  a  plane 
wave  of  wa-  alength  of  X  •  0  500  am  incident  at  the  fim  Bragg  angle  (n  ■  35  2S*1  with  the  plane  of  incidence  inclined  at  an  angle  of  4  -  30*. 
The  grating  vector  is  therefore  nnt  in  the  plane  of  incidence,  la)  TK  component ,.?  the  nortnalued  diffracted  electric  field  for  various  incident 
linear  polarizations,  (hi  TM  componznt  of  the  normalized  diffracted  electric  held,  (cl  Diffraction  efficiency  of  diffracted  wave,  (dt  Angular 
selectivity  fora  grating  with  a  thicln-taof  d  ■  50pm. 
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Fig.  6.  Nunnalired  diffracted  electric-field  amplitude  fop  the  Mm e  grating  n  in  Fiji.  4  and  5  for  a  patina  vector  both  in  plane  of  incidence 
(4  •  0*)  and  out  of  plane  of  incidence  la  ■  30*1  fo.  (a)  incident  electric  field  perpendicular  to  the  crating  vector  and  (b)  incident  magnetic  field 
perpendicular  to  crating  vector.  The  aolid  and  daahed  line*  are  the  normalized  electric  fieida.  The  dotted  line  a  the  phase. 


Solving  this  equation  for  the  first  Bragg  angle  (m  <•  1 )  gives 
a  *  35.26*.  For  an  incident  TE  wave  ■  90*1,  coupling  is 
now  observrd  to  both  the  TE  diffracted  wave  and  the  TM 
diffracted  wave  |Figs.  5(a)  and  5(b)],  Likewise,  an  incident 
TM  wave  (t)  w  0*)  is  coupled  to  both  the  TM  diffracted  wave 
ana  the  TE  diffracted  wave. 

Whether  or  not  the  grating  vector  lies  in  the  plane  of  inci¬ 
dence,  if  is  shou  n  that  the  diffraction  efficiency  can  approach 
I00rc  in  o  lossless  grating  if  the  incident  electric  field  (or 
magnetic  field )  is  perpendicular  to  the  grating  lector  (see 
Figs.  4(c)  and  5(c)|.  The  condition  for  E  to  be  perpendicular 
to  K  is  that  E  •  K  «  0.  and  this  may  be  written  as 

tan  if  -  con  a  cot  4  -  sin  o  esc  4  cot  p.  (48) 

For  the  case  represented  by  Fig.  5.  thU  gives  if  -  54.74*.  For 
H  «o  be  perpendiculn  to  K.  the  value  of  f  ia  changed  by  90* 
from  the  value  given  hy  Eq.  (481.  For  the  cases  of  incident 
E  jerpcndiculor  to  K  (or  incident  il  perpendicular  to  K), 
if  is  shown  thet  :he  diffraction-efficiency  minima  in  the 
angular-selectivity  curves  approach  zero.  In  Fig.  4(d),  the 
angular  selectivity  minima  are  nulls  because  the  incident  TE 
and  1 M  (if  "  90*  and  if  *  0")  waves  have  electric  and  mag¬ 
netic  fields,  respectively,  perpendicular  to  the  giating  vector. 
In  Fig.  5(d),  the  incident  TE  and  TM  waves  no  lunger  have 
electric  and  magnetic  fields  perpendicular  to  K,  and  so  non- 
rero  minima  occur  in  the  angular-selectivity  curves.  This  is 
due  to  different  dephasing  rates  with  changing  angle  of  inci¬ 
dence  for  the  resultant  TE  and  TM  components  of  the  dif¬ 
fracted  field. 

Figure  6(a)  shows  the  diffracted  electric  field  for  the  case 
of  E  perpendicular  to  K  for  the  grating  of  Figs.  4  and  5  (4  ■  0* 
and  4  •  JO*).  Likewise,  Fig.  <5(bl  shows  the  diffracted  electric 
field  for  the  rase  of  H  perpendicular  to  K.  The  grating  for  the 
four  cases  d'uieled  in  Fig.  6  is  the  tame  grating  aa  in  Figs.  4 
and  5.  only  the  incident  direction  and  the  incident  polarization 
are  rhanged-  \5  hen  ihe  grating  vector  is  in  the  plane  of  -n- 
cidenre  4  «  0*.  the  diffracted  fields  for  incident  E  perpen¬ 
dicular  to  K  and  incident  H  perpendicular  to  K  are  completely 
decoupled,  as  was  staled  before.  For  the  case  of  the  grating 


vector  not  in  the  plane  of  incidence  (4  »  30*).  incident  E 
perpendicular  to  K  results  in  TE  and  TM  diffracted  field 
components  that  oscillate  with  increasing  thickness  with  Ihe 
same  period,  as  shown  in  Fig.  6fa>.  Because  of  this  syn¬ 
chronism  with  grating  thickness,  the  diffraction  efficiency 
approaches  100%  with  increasing  thickness  |Fig.  5(c)).  This 
is  in  contrast  to  the  TE  and  TM  diffracted  components  of  the 
other  incident  pol&iizations  (tf  -  0*.  30*. 90*)  depicted  in  Figi. 
5(a)  and  6(b)  for  4  «  30*.  Likewise,  incident  H  perpendicular 
to  K  also  results  in  TE  and  TM  diffracted  field  components 
that  are  in  synchronism  with  grating  thickness  as  shown  in  Fig. 
6(b),  and  this  leads  to  essentially  lltt%  diffraction  efficiency, 
at  shown  in  Fig.  5(c>.  Since  there  is  less  incident-wave  to 
diffracted-wave  coupling  for  incident  H  perpendicular  to  K, 
a  larger  grating  thickness  is  required  to  achieve  the  same 
diffraction  efficiency.  The  phase  difference  between  theTE 
and  TM  components  (Fig.  6.  dotted  lines)  represents  the  de- 


Fut.7.  Diffraction  efficiency  of  first-order  (i  *  411  backward  dif¬ 
fracted  wsve  tors  slanted-fringe  (•  •  10*1  grating  with  a  plane  wave 
of  wavelength  of  A  v  O.fOOpm  incident  at  the  first  Bragg  angle  fit  “ 
2n  87*  |  with  the  plsne  of  incidence  inclined  at  an  an-!e  of  i  *  45*. 
The  grating  vector  is  therefore  not  in  the  plane  of  incidence.  The 
grating  h  kaalrea  and  haa  an  average  relaine  permittivity  of  at  "  2-25 
and  a  relative  permittivity  modulation  of  001. 
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im  of  »UipUWty  of  the  output  poiatiution.  For  the  cue 
shown,  the  elliptic  jiy  n  relatively  small  because  ot  the  iveraie 
relative  permiltivity'i  being  the  same  in  all  three  regions.  If 
these  dielectric  constants  differed  significantly  in  the  three 
regions,  the  elliptuity  would  be  correspondingly  greater. 

Figure  7  shows  the  diffraction  efficiency  of  the  first -order 
(i  “  el)  backward-diffracted  wave  that  is  due  to  a  reflection 
grating  <0  •  10*).  The  grating  vector  does  not  lie  in  the  plane 
of  incidence  (A  »  45*).  In  this  case  the  diffraction  efficiencies 
for  all  linear  polarization  angles  $  approach  100%  with  in¬ 
creasing  thickness.  However,  as  before,  the  coupling  between 
incident  and  diffracted  leaves  it  g reatett  for  tne  incident 
electric  field  perpendicular  to  the  grating  vector.  Thus  the 
diffraction  efficiency  increases  most  rapidly  with  increasing 
thickness  for  that  cose. 

The  vector  diffraction  theory  described  in  this  paper  can 
alw  be  applied  to  surface-relief  (corrugated)  gratings  by  using 
the  method  of  decomposition  into  planar  grs'ings  described 
in  Ref.  31. 

SUMMARY 

A  rigorous  three-dimensional  vector  coupled-wave  analysis 
of  diffraction  by  a  slanted-fringe  grating  has  been  presented 
for  an  arbitrary  angle  of  incidence  in  three  dimensio.is.  The 
method  applies  to  any  sinusoidal  or  nonsinusoidal  complex 
permittivity  (amplitude  and/or  phase)  grating  and  any  linear 
polarization.  Since  no  physical  approximations  are  made,  any 
arbitrary  level  of  numerical  accuracy  may  be  achieved.  It  has 
been  shown  that  the  TE  and  TM  components  of  an  incident 
plane  wave  arc  each  coupled  to  all  the  TE  and  TM  compo¬ 
nents  of  all  the  forward-diffracted  waves  and  backward-dif¬ 
fracted  waves.  For  a  general  Bragg  angle  of  incidence  in  three 
dimensions,  it  u  shown  that  the  diffraction  efficiency  can 
approach  100%  for  a  lossless  grating  if  either  the  incident 
electric  field  or  the  incident  magnetic  field  is  perpendicular 
to  the  grating  vector.  Maximum  coupling  between  incident 
and  diffracted  waves  •*  shown  to  occur  when  the  incident 
electric  field  is  perpendicular  to  the  grating  vector.  In  general, 
the  diffracted  waves  are  elliptically  polarized.  Even  though 
it  is  possible  to  Tabulate  the  fields  for  any  polarization  <^) 
given  the  fields  for  two  orthogonal  polarizations,  these  or¬ 
thogonally  polarized  fields  must  be  first  calculate  from  a 
general  three-dimensional  vector  theory,  such  as  that  pre¬ 
sented  here,  because  of  the  inherent  coupling  between  these 
fields. 
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Abstract.  Planar  (cosinusoidal  conductivity  (absorption)  transmission  gratings  are  ana¬ 
lyzed  using  rigorous  coupled-wave  theory.  The  fit -t-order  and  higher-order  diffraction 
efficiencies  are  de'ermined  over  the  entire  range  of  possible  conductivities  and  Bragg  angles 
of  incidence  (or  equivalently,  grating  periods)  for  H-mode  polarization  incident  plane 
waves.  The  maximum  possible  first  diffracted  order  efficiency  is  found  to  be  5.26  V 
Rigorous  results  are  compared  to  approximate  results  from  the  Raman-Nath  th  or y  and 
the  two-wave  first-order  coupled-wave  (Xogelnik)  theory.  A  regime  parameter,  p,.  is  defined 
which  delineates  the  regions  of  Raman-Nath  diffraction  behavior  (g,  <  I )  and  the  region  of 
two-wave  first-order  diffraction  theory  behavior  (<>,>!).  Likewise,  the  angular  selectivity 
characteristics  of  conductivity  gratings  are  determined  from  rigorous  theory  and  are 
compared  with  corresponding  results  from  approximate  theory. 

PACS:  L’M.  42.40 


Optical  diifraction  by  planar  transmission  gratings  is  a 
subject  uf  fundamental  importance  in  optics.  Fields  or 
application  include  acousto-optics,  integrated  optics, 
quantum  electronics,  holography,  and  spectroscopy, 
Grating  device  functions  include  laser-beam  deflection, 
modulation,  coupling  filtering,  distributed  feedback, 
distributed  Bragg  reflection,  holographic  beam  com¬ 
bining.  wavelength  multiplexing,  and  wavelength 
demultiplexing 

A  rigorous  coupled-wave  theory  (without  approxi¬ 
mations)  has  recently  been  formulated  for  lossless 
dielectric  gratings  with  relative  permittivity  (dielectric 
constant)  modulation  [I],  This  analysis  has  been 
shown  to  be  general  and  the  approximations  used  in 
previous  theories  have  been  explicitly  quantified  [2].  It 
is  the  purpose  of  this  paper:  I)  to  extend  the  rigorous 
coupled-wave  analysis  to  (cosinusoidal  conductivity 
(absorption)  gratings.  2)  to  show  that  the  maximum 
diffraction  efficiency  is  5.26%  (rather  than  3.70%  from 
Kogelnik  theory  [3)  or  4.80%  from  Raman-Nath 
theory  [4]  for  these  gratings),  3)  to  define  the  diffrac¬ 
tion  regimes  and  their  boundaries  for  transmission 
absorption  gratings,  and  4)  to  determine  rigorously  the 


angular  selectivity  characteristics  of  these  gratings  and 
compare  them  to  those  from  approximate  theory.  To 
assist  in  isolating  the  basic  diffraction  characteristics 
from  other  physical  effects,  the  fundamental  case  of  the 
same  permittivity  inside  and  outside  the  grating,  an 
unslanted  grating  (fringes  perpendicular  to  surface), 
and  H-mode  polarization  (electric  field  perpendicular 
to  the  plane  of  incidence  and  perpendicular  to  the 
grating  vector)  is  treated. 

I.  Theory 

/./.  Conductivity  Grating 

The  gratings  analyzed  in  this  work  have  a  conductivity 
of  the  form 

efx)wev+e,cosJfx.  (I) 

The  grating  is  unslanted  with  grating  vector  K  (of 
magnitude  K  ■  2 n/A.  A  being  the  grating  period)  along 
the  x-axis.  The  planar  surfaces  of  the  grating  medium 
are  at  t »  0  end  t-d.  The  p'ane  of  incidence  is  the  x  -  z 
plane  and  thus  all  quantities  ate  invariant  in  the 


y-direc*ion.  The  permittivity  «  of  the  grating  if  con* 
■mm  and  equal  to  the  permittivity  of  the  surrounding 
medium.  The  permeability  p  is  that  of  free  space.  In 
terms  of  these  parameters  the  attenuation  factor  a(x)  is 

+(*/<«€  (2) 

where  at  is  the  angular  frequency  of  the  incident  light 
wave.  The  primary  quantities  of  interest  here  are  the 
diffraction  efficiencies  of  the  first-order  and  higher- 
order  transmitted  diffracted  waves.  In  particular,  the 
maximum  value  of  the  first-order  diffraction  efficiency 
is  obtained  for  the  total  range  of  conductivities  and 
grating  periods  at  Bragg  incidence. 

IJ.  Rigorous  CoupleJ-Wave  Theory 

The  rigorous  coupled-wave  equations  for  an  unslanted 
(cosinusoidal  conductivity  grating  for  H-mode  polar¬ 
ization  are 

-rf^-Tfe-  0  )  “ST 

+ j  i(<"  -  OS/c)  - j  j  o ,  iJofS, , ,  (c) + S, .  ,(i)]  =  0 . 

(3) 

where  S,(r)  is  the  normalized  amplitude  of  the  f  *  space- 
harmonic  field  at  any  point  within  the  modulated 
region,  i.  is  the  free  space  wavelength  of  the  incident 
plane  wave.c,  is  the  permittivity  of  free  space.  c0  is  Che 
relative  permittivity  (dielectric  constant)  inside  and 
outside  of  the  grating.  6'  is  the  angle  of  incidence  in  the 
input  region. 

m«2dei'1sinff/il  (4) 

is  the  Bragg  condition  for  an  unslanted  absorption 
grating  (m  m  1  for  incidence  at  the  first  Bragg  angle,  0,). 
<fo**(f*o/eo)l,>  '*  characteristic  impedance  of  free 
space,  and  p0  is  the  permeability  of  free  space.  These 
rigorous  coupled-wave  equations  can  be  solved  by 
state-variable  methods  [5].  Then  with  the  application 
of  electromagnetic  boundary  conditions  (continuity  of 
tangential  E  and  tangential  H  at  z-0  and  r-<f),  the 
diffracted  fields  and  thus  the  diffraction  efficiencies 
can  be  calculated  for  any  order,  reflected  or 
transmitted  [(]. 

/J.  TV/o-Wave  First-Order  Thtory 

In  this  approximation  to  the  rigorous  theory 
(Kogelnilc  theory  [3]),  the  only  orders  retained  in  the 
analysis  are  f«0  and  +1;  the  second  derivatives  of 
field  amplitudes  are  assumed  negligible;  and  the 
boundary  conditions  on  the  two  space-harmonic  field 
amplitudes  are  assumed  to  be  S0(0)»!  and  5,(0) “0. 


The  diffraction  efficiency  for  the  first-order  transmitted 
wave  according  to  this  theory  is  given  by 

<5> 

and  the  zero-order  (undiffracted)  transmitted  efficiency 
is  predicted  to  be 


(6) 


The  maximum  first-order  diffraction  efficiency  occurs 
when  a,  »es  and  i?o0(d/24'Icos0'«ln3.  This  maxi¬ 
mum  efficiency  has  a  value  of  DE  ,  1/27  s  3.70  V 

The  results  of  this  well-known  two-wave,  first-order 
approximation  are  used  as  a  comparison  for  the  results 
obtained  from  rigorous  theory. 

1.4.  Multiwave  First-Order  Theory  Without  Dephasing 

la  this  approximation  to  the  rigorous  theory  (an 
extension  of  the  Raman-Nath  theory  of  phase  gratings 
[6-8]  to  absorption  gratings  [4]k  the  second  de¬ 
rivatives  of  the  space-harmonic  field  amplitudes  are 
assumed  negligible,  dephasing  from  the  Bragg  con¬ 
dition  is  ignored,  and  the  boundary  conditions  on  the 
space-harmonic  field  amplitudes  are  assumed  to  be 
S0(0)-1  and  S,(0)-0  for  i+0.  The  diffraction  ef¬ 
ficiency  predicted  for  any  transmitted  diffracted  order  i 
is  given  by 

<* 

where  I*  is  a  modified  Bessel  function  of  the  first  kind 
of  integer  order  L  The  quantity  i  is  equal  to  the 
diffracted  order.  The  maximum  first-order  diffraction 
efficiency  occurs  when  e,  »e0  and  »f0«0if/2£j  J  cosff 
"1.545  and  has  a  value  of  DEliBnw4.80V  The 
results  of  this  multiwave,  first-order  theory  without 
dephasing  are  used  as  a  comparison  for  the  results 
obtained  from  rigorous  theory. 


2.  DifTracttoa  Characteristics 

To  determine  the  diffraction  characteristics  of  planar 
(co)sinusoidal  conductivity  gratings,  the  first-order  and 
higher-order  diffraction  efficiencies  were  calculated 
using  the  rigorous  coupled-wave  theory.  The  maxi¬ 
mum  first-order  transmitted  diffraction  efficiency  was 
determined  for  each  value  of  conductivity  modulation 
and  Bragg  angle  of  incidence  (or  equivalently,  the 
grating  period).  The  rigorously-determined  diffraction 
efficiencies  were  then  compared  with  results  from  the 
two-wave  first-order  coupled-wave  (Kogelnik)  theory 
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and  the  Raman-Nath  theory.  The  regions  of  validity  of 
these  approximate  theories  were  then  delineated. 
Similarly,  the  angular  selectivity  characteristics  were 
calculated  using  rigorous  theory  and  compared  with 
results  from  approximate  theory. 


2.1.  Maximum  Diffraction  Efficiency 

The  maximum  first-order  transmitted  diffraction  ef¬ 
ficiencies  in  percent  for  a  range  of  Bragg  angles  of 
incidence  and  grating  conductivity  amplitudes  are 
presented  numerically  and  graphically  in  Table  1  and 
Fig.  I,  respectively.  The  conductivity  modulation 
amplitude  is  always  equal  to  the  average  conductivity 
value,  as  this  is  necessary  for  maximum  diffraction 
efficiency.  The  wavelength  of  the  incident  light  is 
500  nm,  and  the  grating  period  is  varied  to  keep  the 
angle  of  incidence  always  at  the  first  Bragg  angle 
(m->  1).  The  relative  permittivity  (dielectric  constant) 
both  inside  and  outside  the  grating  is  the  same  in  order 
to  eliminate  the  effects  due  to  discontinuities  in  the 
average  index  of  refraction.  For  near-normal  incidence 
and  tower  values  of  conductivity,  the  maximum  dif¬ 
fraction  efficiency  tends  to  the  value  of  4.80  %  predicted 
by  the  Raman-Nath  multiwave  theory,  which  neglects 
dephasing  For  conditions  of  near-normal  incidence, 
there  are  many  closely  angularly-spaced  propagating 
difTracted  orders  and  dephasing  is  indeed  expected  to 
be  of  minor  importance.  For  larger  Bragg  angles  of 
incidence  and  lower  values  of  conductivity,  the  maxi¬ 
mum  diffraction  efficiency  tends  to  the  value  of  3.70% 
predicted  by  the  Kogelnik  two-wave  first-order  theory. 
For  these  larger  angle*  of  incidence,  the  higher-order 


waves  are  evanescent  and  rigorous  multiwave  theory 
may  be  approximated  in  practice  by  a  two-wave 
calculation. 

A  significant  structural  feature  in  the  resulting  maxi¬ 
mum  diffraction  efficiency  surface  (Fig.  I)  occurs  for 
those  Bragg  angles  of  incidence  at  which  higher-order 
diffracted  waves  are  at  the  transition  from  propa¬ 
gating  to  evanescent  (cut-off).  For  example,  the 
angle*  *in' '(1/9)  a  6.38',  sin ~ '(1/7) i 8.21*.  and 
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Table  l  Example  fundamental  and  hifher-order  diffraction  efficiency  for  a  Bran  anflc  incidence  nMBff-  for  linuioidal  conductivity  palings 
according  to  the  Raman-Nath.  Ko*eloU.  and  ngorous  coupled-wave  theory.  The  firu  cate  le  - 1  mho 'ml  n  m  the  Bragg  regime  (Kogctnih 
theory)  and  the  second  case  <«,  - 10*  ntho/m)  a  in  the  Raman-Natk  regime  Other  parameters  are  a -0.3  pm.  d- 14.33$ pm.  and 

thickness  chosen  ro  mciuiu/c  DE, 


Theory  #,  e.  Diffraction  efficiency  ( \) 


(mho/m] 

DE* 

DE, 

DE, 

DE, 

DE. 

DE, 

Raman-Nath 

1 

(.11  a  to' 

19.# 

4.49 

1.03  x  10" 1 

9.16  »I0-* 

1.73x10-* 

103x10-* 

Kogrtnik 

1 

1  11  >10' 

149 

370 

- 

- 

- 

. 

Rigorous  coupled-wave 

1 

1.12x10' 

14.9 

3-70 

1.36*  Hr* 

5.77  x|0- •• 

604x10'* 

2.2*  xlO  “ 

Raman-Nath 

to* 

(llx|0J 

130 

4.90 

3911  x  |0‘ 1 

1.61  xl0-» 

I27x|0  * 

293  «I0  * 

Kogrlnik 

10* 

1.11  xto-' 

714 

330 

- 

- 

_ 

_ 

Rigorous  coupled-wave 

to* 

(12  xto  1 

•30 

4.80 

391  x|0-‘ 

3.61  xto-* 

117  xlO’* 

1.63  xio-* 

sin"  *(1/5)=;  11.54-'  exhibit  local  diffraction  efficiency 
maxima  in  the  surface  and  correspond  to  transitions 
from  10  to  8  transmitted  propagating  waves.  8  to  6 
waves,  and  6  to  4  waves,  respectively.  For  the  angle 
sin"  '(1/31-19.47-'  and  a  conductivity  of  55.937 mho¬ 
s/m.  the  global  maximum  of  5.260**  occurs  in  the  first- 
order  transmitted  wave  diffraction  efficiency.  This 
angle  marks  the  transition  from  4  forward-diffracted 
waves  to  2  waves  (i»-l  and  +2  become  cut-off). 
Other  transitions,  of  course,  occur  for  specific  angles 
less  than  sin  "  '(1/9).  However,  the  resulting  local  maxi¬ 
ma  are  masked  by  the  overall  Raman-Nath  behavior 
of  the  surface  in  that  region. 


2.2.  Diffraction  Regime « 

The  regime  where  the  two-wave  first-order  theory 
accurately  predicts  the  diffraction  characteristics  is 
often  referred  to  as  the  “Bragg  regime”.  The  region 
where  Raman-Nath  theory  is  accurate  is  called  the 
”Raman-Nath  regime".  These  regions  may  be  distin¬ 
guished  by  a  regime  parameter.  The  conductivity 
grating  regime  parameter  g,  is  defined  as 


(8) 


by  analogy  to  the  regime  parameter  g  for  phase 
gratings  [9-1 1]  which  is 

e  =  Ul/c,A\  (9) 


where  t,  is  the  amplitude  of  the  relative  permittivity 
modulation  of  the  phase  grating.  The  condition  g,—  I 
separates  the  tr-fl,  plane  into  two  regions  as  shown  in 
Fig  I.  For  the  region  of  p,>l,  which  includes  large 
Bragg  angles  of  incidence  (small  grating  periods),  the 
two-wave  first-order  (Kogelnik)  result  as  given  by  (5) 
produces  accurate  results  for  the  fundamental  dif¬ 
fracted  order  (i=  + 1)  for  conductivities  up  to  about 
101  mho/m.  In  the  g,  >  I  regime,  the  transmitted  power 
is  concentrated  primarily  in  the  i»0ind  (-  +  1  orders. 


In  (act,  the  higher-order  diffraction  efficiencies  calcu¬ 
lated  by  rigorous  theory  were  found  to  obey  the 
condition 

I  DE|<I /cl.  (10) 

<•0.  i 

That  is.  the  sum  of  all  of  the  higher-order  diffraction 
efficiencies  is  less  than  I/p*.  This  is  exactly  analogous 
to  the  Bragg  regime  two-wave  criterion  for  phase 
gratings  [II],  Also  for  g,> I,  the  values  of  the  trans¬ 
mitted  wave  (i~0)  efficiency  calculated  by  rigorous 
theory  were  compared  with  the  values  predicted  by  (6) 
from  Kogelnik's  theory.  Good  agreement  was  again 
found  except  at  high  conductivities.  An  example  g.  >  I 
case  showing  this  agreement  is  given  in  Table  2.  Since 
two-wave  first-order  theory  neglects  all  diffracted  or¬ 
ders  except  the  1-0  and  i-  + 1  orders,  there  are  no 
predictions  for  toe  higher-order  waves  using  this 
theory  and  these  are  indicated  by  dashes  in  Table  2. 
For  the  region  of  p,  <1.  the  diffraction  efficiencies  of 
ail  diffracted  orders  (in  addition  to  the  i  =  + 1  order) 
were  calculated  by  rigorous  theory  and  then  compared 
with  the  values  predicted  by  the  Raman-Nath  theory, 
(7),  The  p.<l  regime  includes  near-normal  Bragg 
incidence  (large  grating  periods).  In  this  region  the 
Raman-Nath  formula  as  given  by  (7)  was  found  to 
produce  accurate  results  for  conductivities  up  to  about 
5  *  10*  mho/m.  This  clox  agreement  for  the  first-order 
diffracted  wave  is  apparent  in  Table  I  and  Fig  1  For 
the  zero-order  and  higher-order  diffraction  efficiencies, 
similar  good  agreement  was  found.  A  single  typical 
p,<  I  cast  showing  the  agreement  with  Raman-Nath 
theory  is  Included  in  Table  2. 


2.3.  Angular  Selectivity 

A  Bragg  condition  occurs  whenever  m  in  (4)  is  an 
integer.  Dephasing  from  the  Bragg  condition  may  be 
produced  for  a  fixed  grating  by  changing  the  angle  of 
incidence  and/or  the  wavelength.  For  m»  I.  it  is  the 
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first  or  fimdsmenul  Brags  incidence.  In  this  case,  there 
is  efficient  power  transfer  from  the  incident  wave  to  the 
i m  + 1  diffracted  order.  Mathematically,  this  is  due  to 
the  foctoc  (as— i)  being  zero  in  the  rigorous  coupled- 
wave  equations,  (3j  This  Sf,i)  term  in  the  rigorous 
coupled-wave  equations  represents  dephasing  from  the 
Bragg  condition.  When  it  is  zero,  there  is  no  dephasing 
and  Bragg  incidence  occurs.  The  two-wave  first-order 
coupled-wave  analysis  of  Kogelnik  retains  the  effects 
of  dephasing  from  the  Bragg  condition.  The  Raman- 
Nath  theory  neglects  this  term  entirely,  and  any  angle 
of  incidence  and  wavelength  is  treated  as  Bragg 
incidence. 

The  angular  selectivity  of  a  grating  is  a  measure  of  the 
sensitivity  of  the  diffraction  to  changes  in  the  angle  of 
incidence.  The  angular  selectivity,  AO,  may  be  defined 
as  the  full  angular  deviation  about  the  first  Bragg  angle 
(m«l)  which  causes  a  reduction  in  the  diffraction 
efficiency  to  one  half  the  value  at  the  Bragg  angle.  This 
angular  selectivity  may  be  calculated  from  rigorous 
coupled-wave  theory  or  from  approximate  two-wave 
first-order  coupled-wave  theory  since  these  theories 
include  dephasing  effects.  The  angular  selectivity  is 
given  by 

=  (II) 

where  0*  and  0~  are  the  angles  of  incidence,  greater 
than  and  less  than  the  Bragg  angle,  respectively,  at 
which  the  diffraction  efficiency  has  dropped  to  one  half 
of  the  value  at  the  Bragg  angle.  From  two-wave  first- 
order  (Kogelnik)  theory,  these  quantities  are  given  by 

e  -sm  i  wSF - I 

(12) 

The  quantity  J  is  a  dephasing  parameter.  If  {  -0,  there 
is  no  dephasing  and  01  indicating  d6« 0  (in¬ 
cidence  at  Bragg  angle).  For  the  maximum  efficiency 
1/27)  in  this  theory,  it  is  {•=0.8932.  The 
angular  selectivity  may  not  be  calculated  from  Raman- 
Nath  theory  since  that  theory  does  not  include  any 
dephasing  effects. 

A  comparison  of  some  angular  selectivity  results  from 
rigorous  theory  and  from  Kogelnik  theory  are  shown 
in  Table  3.  In  each  case  the  first  Bragg  angle  8,-30°, 
the  wavelength  A»500nm,  and  the  grating  is  fully 
modulated  o0.  For  each  conductivity,  the  thick¬ 
ness  that  maximizes  the  first  diffracted  order  power  is 
used.  For  relatively  thick  gratings,  the  rigorous  theory 
and  the  Kogelnik  theory  predict  the  same  angular 
sensitivities.  For  high  conductivity  thin  gratings,  the 
angular  selectivity,  AO,  approaches  approximately 
80°.  However,  approximate  two-wave  first-order 
(Kogelnik)  theory.  (12),  predicts  that  the  angular  selec¬ 
tivity  approaches  180’  in  the  limit  of  increasing 
conductivity. 


Tsble  ).  Ang  uUr  selectivity  for  sinusoidal  conductive  frump  The 
full  angular  devialioa  about  the  first  Bragg  angle.  40.  that  causes  a 
reduction  ia  the  diflractmu  efficiency  to  one  half  of  the  value  at  the 
Bragg  ingle  is  fivea  Both  the  epproumaie  value  of  40  from  Kogd- 
nik's  two-wave  first -Older  coupied-uavc  theory  end  the  value  from 
the  present  rigorous  theory  are  shown.  In  each  case  0,-30. 
i  -  500  nm,  and  Or  grating  is  fully  modulated.  The  indices  of  refrac¬ 
tion  inside  and  outside  of  the  grating  are  equal 


0 

[mho/m} 

i 

t— I 

40  (degrees] 

Kofcliuk's 

theory 

Rigorous 

theory 

io-* 

5.05*10' 

5.01  *10* 

5  .08  *10  * 

1 

505 

508  *I0'» 

508*10-* 

20 

SOS  « IO-1 

508  *10'* 

508*10-* 

10* 

505*10-* 

5.08*10-' 

5  08*10  ' 

10* 

505*10-* 

5.07 

508 

10° 

5.1**  IO-* 

4.t»*IO' 

5.06  *  10* 

10* 

1X5*10  * 

137x10* 

7  83  *10' 

10* 

107*10-* 

1.75*10* 

7  79*10' 

to’ 

t.«*io-* 

1  7**10* 

7.14*10' 

3.  Summary  and  Discussion 

The  rigorous  coupled-wave  equations  for  (col- 
sinusoidal  conductivity  (absorption)  gratings  have 
been  presented.  These  were  then  solved  subject  to  the 
appropriate  electromagnetic  boundary  conditions  for 
the  first-order  and  higher-order  transmitted  diffraction 
efficiencies  for  (he  entire  range  of  possible  conducti¬ 
vities  and  first  Bragg  angles  of  incidence  (equivalent  to 
the  range  of  possible  grating  periods)  These  results 
were  then  compared  to  results  from  the  Raman-Nath 
and  two-wave  first-order  (Kogelnik)  approximate 
theories.  Example  results  are  shown  in  Table  2.  The 
global  maximum  diffraction  efficiency  for  the  first- 
order  transmitted  diffracted  wave  was  found  to  be 
3.26%  rather  than  4.80%  or  3.70%  predicted  respective¬ 
ly  by  the  Raman-Nath  and  Kogelnik  approximate 
theories. 

A  conductivity  grating  regime  parameter  was  defined 
as  e,»4xit/i|0e,dI  by  analogy  to  the  phase  grating 
regime  parameter  [9- II].  The  condition  f,*l  was 
shown  to  delineate  Raman-Nath  diffraction  behavior 
(e,  <  1)  and  two-wave  first-order  (Kogelnik)  diffraction 
behavior  (e,>l)  For  sufficiently  high  conductivities 
(about  3  x  104  mho/m  for  Raman-Nath  theory  and 
about  1 03  mho/m  for  Kogelnik  theory),  it  was  shown 
that  these  approximate  theories  no  longer  give  ac¬ 
curate  results  even  though  the  regime  parameter  con¬ 
dition  is  met  (Fig.  1). 

The  angular  selectivity  characteristics  of  these  planar 
conductivity  gratings  were  analyzed  using  rigorous 
coupled-wave  theory.  Two-wave  first-order  approxi¬ 
mate  theory  was  found  to  give  accurate  predictions  for 
conductivities  up  to  about  IQ4 mho/m,  but  overesti- 
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meted  the  angular  selectivity  for  higher  con¬ 
ductivities. 

H-tnode  polarization  (electric  field  perpendicular  to 
the  plane  of  incidence  and  perpendicular  to  the  grating 
vector)  has  been  analyzed.  However,  E-tnode  polariza¬ 
tion  may  be  treated  in  the  tame  manner  by  starting 
with  the  E-mode  coupled-wave  equations,  as  shown  in 
[12]. 
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Sympotluai  oa  DUTiutlvi  Optical  Element* 

Invited  Paper* 

TUI.  MUhUw  la  Ike  Evolution  ef  Diffractive  Ordeal  Element*.  A.  w  bOHMANN.  Phytiheliache*  fiutilut 
drr  Uniuersitil  Erlcngen-Numkerg,  Eruin-RommeUStrasse  I,  8520  Erlangen,  Weil  Germany  —  By  mu*  of  dif¬ 
fraction  wu  are  able  to  modify  the  direction,  the  amplitude,  the  phae*.  the  polariaation.  the  shape  ef  a  wavttrain.  the 
three-dimemioual  distribution  of  enetyy  and  efliyhl  pressure,  and  the  spectral  composition  of  a  liyht  beam.  Device* 
that  porfona  such  modification*  an  called  diffractive  optical  element*  (DOKV.  W«  briefly  review  the  history  of  tome 
DOE'*.  euch  at  patiof*.  too*  plate*,  epatial  fiKeta,  three-dimensional  DOE's,  and  holoyrara*.  The  emphasis  it  on 
modem  vtieiont  of  DOE'a.  Meet  of  the  applications  an  dealt  trith  ia  the  companion  lecture.  (23mm.) 

ThAZ.  Application*  ef  Diffractive  Optical  Element*.  H.  1.  CAULFIELD.  Applied  Science*  Division,  Aerodyne 
Research,  Inc,  45  Manning  Hoad.  Billerica,  Massachusetts  01821.— Diffractive  optical  elemtata (DOE'tl  art  proviny 
to  he  of  aiynificant  value  in  a  and*  variety  of  field*.  The  formation  of  hi*  h-yuality  diffractive  optical  element*  it  be- 
cominy  a  matter  of  routine  lechnoloyy  in  eeveral  placet  around  the  world.  The  firtt  leak  of  e  DOE  detiyner  ie  la  be  cer¬ 
tain  that  the  deaifn  doe*  not  violate  any  of  the  phyeieal  thtrmodyramic  lew*.  Some  of  the**  lean  are  well  known  in 
optica  in  (eneral.  other*  eeem  peculiar  to  DOE  *  The  new  law*  have  to  do  with  the  mutual  effect*  of  tptctral  and  spa- 
lial  coherence  in  both  recordinf  indue*  of  DOEi.  Among  choice*  available  to  the  DOE  detiyner  art  activity  (ptaeive. 
active,  volatile),  dimentionality  (bulk,  inteyrated),  and  interaction  mod*  (reflective,  transmissive,  both).  We  turvey 
applicationa  by  esnsideriny  what  claaaical  optical  element*  can  be  replaced  by  DOE'*.  A  partial  liat  include*  diffrac¬ 
tion  fratinfi.  prism*.  beam  aplitierv  pinhole  Til  ten,  dicbroic  fthera,  wavtieayth  multiplaaer*  and  demultipleiera,  fiber 
coupler*.  *ource-fib*r  and  fiber-detector  coupler*,  scanners.  minor*,  retroreflectora,  inteyrated  optical  input-output 
coupler*,  laaert.  multiple  imsfiny.  and  unayt  conversion.  Each  ef  that*  ia  diacuaaed  a*  an  iUuatralion  of  the  |  eneral 
principle*  deecribed  earlier.  (13  min  i 
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TEAS.  Diffraction  of  Finite  Beam*  by  Dielectric  Gratiays.  M. 
O.  MOHARAM  AND  T.  K.  GAYLORD,  School  o/  Electrical  Engineering. 
Georgia  Institute  of  Technology.  Atlanta.  Georgia  30332— Elec- 
tromaynetic  diffraction  by  dielectric  yretinia  hat  been  erteruively 
treated  in  the  literature.  However,  moat  theoretical  inveetiyation* 
apply  to  diffraction  of  infinite  plane  wave*  rather  than  to  realiatic 
Unite  haama  In  thia  work,  the  diffraction  offuiite  beam*  by  dielectric 
yratiny*  baa  been  analyud  uainy  riyoroua  coupled-wave  analyti* 
OMabinad  with  a  plane-wav*  euperpoaitian  reprceenUlion  of  the  finite 
beam.  Riyoroua  calculation*  for  the  diffraction  charecteriatica  of 
Gauaaian  beama  incident  upon  planar  and  rectanyular  yrooved  di¬ 
electric  yratiny*  have  been  performed.  The  diffraction  efficiency 
alony  with  the  near- and  far-field  profile*  of  the  diffracted  beama  are 
praarntad  It  it  ahown  that  the  product  of  the  ratio  of  the  beam  width 
to  the  liyht  waveleryth  and  the  tnyular  width  of  the  anyular  aelectivity 
char  arte  riatic  of  tha'yratiny  (for  a  plan*  wave)  play  critical  role*  in 
the  diffraction  characteristic*.  for  wide  beam*  and/or  brood  anyular 
aelectivity  characteriatica  (multiwave  diffraction  reyimel  (he  dif¬ 
fraction  praceai.  in  the  near  field,  ie  eaten tially  the  diffraction  of  a 
plane  wnv*  with  the  diffracted  order*  multiplied  by  the  incident  beam 
profile.  For  extremely  narrow  beam*  and/or  narrow  anyular  aelec¬ 


tivity  c bar* etcria tics  (ttrony  Brayy  diffraction  reyimel  riyoroua  cal- 
culetioo*  art  needed.  (13  min.) 

ThAI.  Diffraction  Charaeteriatkeef Surface-Relief Dielaetric 
Cratinya.  M  C  mokaram. T. K  Gaylord, c  T. sincerbox- h. 
enOUJCH.*  AND  B.  YUNG.*  School  of  Electrical  Engineering.  Georyw 
Institute  o/  Technology.  Atlanta.  Georgia  30332— Surface-relief 
dielectric  yratiny*  with  aribtrary  profile*  are  of  wide  interest  owiny 
to  their  many  application*  in  quantum  electronic*,  inteyrated  opuc*. 
boloyraphic  optical  element*,  and  *p*etroacopy.  Recently  aurface- 
relief  yratiny*  with  arbitrary  profile*  have  been  analysed  uainy  a 
riyoroua  raupled-wavt  approach,  b  thia  work,  riyoroue  calculations 
for  the  diffraction  e/fioeney  and  wevolenylh  and  anyular  eelectivitie* 
for  TE  and  TM  polarization*  are  presented  for  several  profile*,  in- 
'  cludiny  rectanyular.  sinusoid  cl,  sawtooth,  and  profiles  fabricated  by 
IBM.  The  dependence  of  the  ciectreaaynetic  diffraction  process  on 
the  various  parameters  involved  ia  presented  also.  Diffraction  of  fi¬ 
sh*  beams  by  aurfsce-relief  dielectric  yratiny*  ia  diacuaaed  briefly. 
(13  min.) 

‘  IBM,  Sta  feet,  California. 
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mi  m  a  point  nenhnaarity  to  Implement » tno  tfimonainnil  array 
of  independent  logic  (NOR)  gataa.  and  one  or  man  hologram*,  used 
te  interconnect  the  gates.  la  tlia  pa^tr  ««  dwribt  nquinwou 
on  tha  SLM  to  auun  proper  functioning  of  tin  optical  logic  system, 
i«ti«(  into  account  tow  of  Um  tbuKinatia  peculiar  to  SLMi 
The  traiwfet  characterietic  onset  be  negative -going  (inverting)  and 
(bistability  ie  not  aacaaaary).  Te  ensure  regeneration  of 
the  signal  lava)  at  each  paaa  through  a  gala,  the  Uanefaa  chaiactaaiatic 
must  ashibit  gain,  saturation.  and  a  threshold.  These  three  rharac- 
tanatka  aleo  affect  the  noise  immunity,  the  cross- talk  tolerance,  and 
the  gate  danaity.  The  ultimata  limit  on  the  fen -out  of  each  gale  ia 
determined  by  the  output  control  ratio.  The  uniformity  of  the 
tranefer  characterietic  over  the  SLM  active  area  it  alao  oonaidered. 
It  ho  an  effect  an  the  yield  of  the  daaign  and  layout  of  a  large  circuit 
or  proceoor,  on  the  predicted  error  rale,  and  on  the  above  require- 
menu  o  well.  (13  min.) 

ThBS.  Optical  Truth-Table  Look-Up  Troceeaiag  of  Digital 
Date.  M.  M.  MIRSALEHI.  C  C  GUEST.  AND  T.  K  GAYLORD.  School 
of  Electrical  Engineering.  Crorgio  hulitute  of  Technology.  Atlanta, 
Georgia  30332  —The  detector  threahold  retting  and  prohabilitiet  of 
error  that  are  due  to  recording  amplitude  and  phaae  variation!  in  the 
operation  of  a  holographic  eontent-ad  Jreaaable  memory  ayilem  for 
tnrth-Ublr  look-up  digital  dau  proceaaing  have  been  analyzed  The 
effecU  of  Ceuaeian  distributions  in  the  amplitude  and  phaae  of  the 
record u«  beams  have  been  treated  The  operations  of  4  . 6-,  12-,  and 
16-hit  addition  and  multiplicatiun  have  been  analyred  for  binary- 
coded  reeidue  numbers.  Setting  each  detector  thrtahold  to  iu  opti¬ 
mum  value  (separate  threahold  tatting  technique)  or  tatting  all  de¬ 
lector  thresholds  vo  a  tingle  optimum  value  (common  threshold  set¬ 
ting  technique)  it  found  to  produce  similar  probabilities  of  trror. 
Optimum  detector  threshold  amplitude  settings  are  typically  about 
63%  of  the  amplitude  of  a  component  wave  corresponding  to  a  tingle 
bit  For  practically  achievable  standard  deviations  in  the  amplitude 
and  phaae  about  the  design  values,  the  resultant  probabilities  of  error 
are  generally  lese  than  10_‘.  Using  logically  reduced  versions  of  the 
truth  tablet  typically  results  in  a  significant  decrease  in  both  the 
probebility  of  error  and  the  number  of  required  holograms.  (13 
min.) 

ThHS.  Optica)  Methods  for  BandsrMlb  Compression  of  Large 
Matrices.  JOHN  N.  LEE  AND  RAVINDRA  a.  ATHAJX.  Code  6330.  V.S. 
Naual  Research  Laboratory,  Washington,  D  C.  20373  —  It  it  desirable 


Is  reduce  the  bandwidth  of  a  large  matrix  before  pro  turning  it  further. 
Out  approach  is  to  perform  a  unitary  transformation  on  the  matrix 
to  obtain  a  sparse  matrix  a  ■  VrGV.  when  U  and  V  art  known  ma¬ 
trices  consisting  of  otthooocmal  columns.  The  tofutioo  of  this 
aquation  ia  computation  intensive  for  large  G.  The  triple. mains 
product  oa  the  right  in  this  equation  can  be  tffkientiy  implemented 
using  one-dimensional  modulators,  such  at  eeoueto-optic  ceils  oe 
a  lectio -optic  modulators,  and  a  two-dimensional  spatial  light  mod¬ 
ulator.  The  neullant  optics!  processors  avoid  output  bottlenecks 
associated  with  pbotodeteetor  readout  by  generating  only  nonsero 
elements  of  a.  Two  different  architectures  have  been  developed:  a 
space-integrating  architecture,  in  which  the  desired  o„  era  output 
fat  a  temporal  ■  fleam  from  a  tingle-element  detector,  and  a  hybrid 
time  apace  integrating  architecture,  in  which  the  desired  ov  accu¬ 
mulate  over  a  linear  time  integrating  pbotodeteetor  array.  (13 
min.) 

ThH7.  Frequency*  Do  mala  Optical  Storage  Using  Spectral 
Bole  Burning,  w  e  moermeb.  m  d  levenson.  r  pokrowsky. 
AND  C.  C.  BLIORKLUKD.  Kf 6/282,  IBM  Research  Laboratory,  36 00 
Cottle  Road,  San  Joee,  California  93193 — The  phenomenon  of 
persistent  spectral  hole  burning  allows  optical  frequency  to  be  utilised 
is  an  additional  dimension  for  the  organisation  of  optical  metcories1 
and  offers  the  potential  for  achieving  e  I  O’  gain  in  achievable  storage 
density  over  conventional  optical  storage.  The  spectral  holes  art 
produced  by  narrow-band  laser  radiation  within  an  inhornogeneoualy 
broadened  sero-phonon  absorption  tins  of  a  photoactive  species 
contained  in  s  solid  host  material  at  cryogenic  temperatures.  Typical 
hole  widths  art  about  100  MHi,  and  typical  inhomogeneous  and  tha 
err  200  GHi.  ao  10*  bits  can  be  stored  in  a  cubic  wavelength  of  material 
by  burning  or  not  burning  holts  at  various  frequency  locations  within 
the  line.  Recent  reauha  on  CaAlAs  laser -compatible  materials1  and 
on  materials  with  in  homogeneous- hooxigrneous  linewidth  ratios  of 
greater  than  2  X  10*  are  presented.  In  addition,  recently  developed 
later  spectroscopic  techniques  for  rapid  reading  and  writing  of  holts 
an  discussed  ’  Systems  concepts  end  materials  requirements  for 
practical  applications  an  elucidated. 

1 G.  Cssuoet  el.  US.  Patent  N»  4.10IJ7C  (197»);  D.  Hsaiat.Soe.  PSoueOpc 
la.uvun.Eng  m.»!ll»1Bt 

*  W.  E.  Meeraer.  F.  M  ScSe  lien  berg,  eat  G.  C.  Bjorkluad.  Appl  Pkyt  B 11 
283(1982). 

>  W.  1  sngdt.  C.  Orris,  ted  G  C.  BjorkW.  Opt  Lett  4.  Ml  II981I:  V.  C. 
Bjorkluad.  W.  Lenlh.  and  C.  Orris.  See.  Pbota-Opl-  Inslrum.  Engl  hi,  107 
(IBBU;  P.  Foktowaky  el  tl .  Opt.  Lett.  (.11011*831. 
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Thll.  Multiple  Scattering  Effects  la  Trasamimornttcr  and 
Lldar  CeaffgunUoat.  A.  zardecki.  s  a.  w.  gerstl.  and  R.  C. 
SHIRKEY.*  Theoretical  Division.  Los  Atamae  National  Laboratory, 
MS  B279.  Lot  Alamat,  New  Mexico B7343.— For  a  laser  beam  prop- 
egeting  in  a  turbid  medium,  we  analyra  the  multiple  scattering  cor- 
ractiona  to  the  Beer-Lambtrl  law  in  the  case  of  a  detector  with  a 
variablt  field  of  view.  A  previoue  work  of  7ardecki  and  Tam1  is  ex¬ 
tended  toe  a  finite  slab  geometry  aad  aleo  for  the  ceaei  of  converging 
and  Ohrerging  beams,  discussed  by  Jensen .*  A  versatile  computer 
code,  based  on  tha  well-known  Dolin-Fante  approach  to  multiple 
eeettering  ie  developed  The  bee kaca tiering  from  randomly  dis¬ 
tributed  acatterara  ia  included  in  the  theory  by  solving,  to  tne  first 
•pproaimation,  a  coupled  system  of  equations  for  the  forward  and 
backward  specific  intensities.  This  ia  equivalent  to  the  rumulalive 
forward  .scatter  ainglebackscattcr  approtimetion  of  da  Wolf.  The 


multi  of  numerical  modeling  for  laser  beams  scattered  from  fogs, 
clouds.  And  other  eerooda  are  gives.  (13  min.) 

*  US.  Army  Atmospheric  Science*  Laboratory. 

1  A.  Zatdocki  and  W.  C.  Tam.  Appl.  Opt  21. 2*13  (1982) 

>R  E  Jesses,  J.  Opt  Sec.  Am.  24,  IM7  (19801 

ThlL  Treaamlaaieo  pf  Laser  Radiation  la  Rais.  L  w  WIN¬ 
CHESTER.  JR..  Krwrrnstr  Rrsearrh  Center,  Michigan  Technological 
Unioenity .  Houghton,  Michigan  49931  —The  tranemittancr  of 
0.6328-,  1 .064-,  and  10  591 -pm  laser  rsdistion  in  rain  ana  measured 
aa  a  function  of  rain  rale  over  a  folded  horizontal  path  of  length  I  km. 
A  model  based  on  tea  tiering  theory  has  been  developed  to  compute 
tnnamillanc*  as  a  function  of  rain  rate  and  the  raindrop  tiaedistri- 
bution.  The  modal  computes  the  contribution  of  aero-,  first-,  and 
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TuBl  ItitwhH  Btlimitl  Galatia  Wm  l»  IhI-TIw 
Hakgraphl*  Canflguratkn*.  anew  CAUXTO  and  R  A.  US- 
MAD.  Department  of  Physics,  Laval  University,  Stt-Foy.  Qurhe c 
OIK  JPt, Canada.  -Undeveloped  dichrcmatad  gelatin  (DCC)  film* 
km baan utilised is re*)-tim* holographic configuration*.  Thalow 
aansilivityc/DCG  to  red  light  (h,  ■  0.L1JJ  am)  is  the  basi*  of  e  pro¬ 
ems  to  recooetnict  ■  hologram  at  th*  mom  time  that  th*  mtarfarene* 
pattern.  formed  with  (mo  tight  (X,  -  0.M43  pm),  it  recorded.  Ths 
Ubsvior  o i  th*  diffraction  efficiency  for  transmission  gratings  a*  a 
Auction  of  thaaxpoaur*  ha*  batn  studied.  Application*  in  mil- 
and  double-expoaura  ra*!-tim#  holography  have  been  med*.  Th ass 
include  Lb*  making  of  Fourier  holograms,  doubie -exposure  interfar- 
agram*  of  phaa*  object*.  optical  difttrantiation,  and  character  rec¬ 
ognition.  (.  3  min.) 

TaB4.  Nonunlfor*  Bra((  HoJoframa.' T  JANNSON.  H.  TIN.  A»n> 
H.  A.  YU. Division  of  Reteorch,  NuUinal  Technical  Systems,  Inc., 
12511  Beatrice  Street ,  Lns  Angeles ,  California  90066  — Significant 
progress  in  th*  technology  of  dichromated  c  r  la  tin  hologram*  ha*  it  ■ 
cantly  stimulated  severs!  research  froup*1  to  iludy  holographic 
concentrator*,  it.,  optical  holographic  olamcnta  focusing  radiation 
spectrum.  On*  of  the  basic  problems  m  the  CM*  of  photovoltaic,  solar 
boloconctnuetora1  is  optimisation  of  dilfraction  efficiency  charac- 
taristica  to  avoid  cross-coupling  effects  in  oultiespoaure  cases.  This 
problem,  which  is  strongly  related  to  physics  snd  photochemistry  of 
dichromated  gslatin  holograms,  can  b*  solved  by  specie  muhilaysr 
technology  that  :ndudcs  controlled  nonunifonn  shrinkage  and 
swelling  effect*  during  holographic  processing.  Thus,  from  a  theo¬ 
retical  point  of  view,  Kogelnik'*  fully  uniform  coupled-wave  theory 
is  not  enough  to  describe  these  phenomena,  ant]  some  new  model*, 
including  uotninifonn  distribution  of  grating  vector  and/or  coupling 
constant  (even  for  elementary  holographic  grating)  in  the  direction 
perpendicular  to  ths  hologram  surface,  hev*  to  be  proposed.  In  such 
cases,  the  grating  vector  uncertainty  relation'  is  especially  useful  In 
this  work  ere  introduce  s  generalised  WKB  method  to  describe  non- 
uniform  volume  holograms  with  high  diffraction  efficiency.  Both 
transmission  and  reflection  geometries  ere  considered,  although  the 
quaai-Lippman  geometry  seems  to  be  the  most  interesting  It  is  an 
attractive  feature  of  this  method  that,  for  a  large  majority  of  typical 
cases  of  noauniformity.  we  obtain  th*  diffraction  efficiency  charac¬ 
teristics  in  analytic  form,  independent  of  the  particular  shape  of  the 
nonunifermity.  Additionally,  experimental  data  supporting  such 
theoretical  models  are  presented  (13  min.) 

*  This  werk  has  been  supported  by  the  US.  Department  of  Energy,  contract 
as  DE-AC0341ER1W.-M. 

■  W.  H.  Bless.  M  Gneeinger.  end  E.  R  Rsinbard.  Appi.  Opt  II.  173*  (1942). 

•  T.  Jaasaoo.  J.  Opt  Sue.  Am  M.  342 11S81  > 


TuHL  Valnma  Diffract***  by  Saparlwpaaad  Grating*.  i  w. 
LCWU  AND  L.  K7LYMAA.  Department  uf  Engineering  Science, 
University  sf  Oxford.  Pa.  he  Road,  Oxford  0X1 SPJ.  England  —As 
«n  attanaioe  of  the  one  dimensional  thaory  of  volume  diftrectioB.  a 
continuous  monochromatic  spectrum  of  plan*  wav**  incident  upon 
a  uniform  sinusoidal  grating  may  be  considered.  Sines  any  grating 
can  be  rag  sided  as  s  superposition  of  sinusoidal  gratings,  the  genera) 
problem  to  solve  is  an  infinite  number  of  plan*  wave*  incident  upon 
an  infinite  number  of  gratings.  In  a  low -efficiency  approximation 
ths  effacl  of  aach  grating  may  be  coneiderad  separately,  and  the  total 
diffracted  field  may  b*  obtained  by  summing  th*  individual  contri¬ 
bution a.  However,  in  th*  high -efficiency  case,  when  the  incident 
beam  M  depleted,  th*  interaction  among  the  various  grating*  cannot 
ho  neglected,  and  a  new  farmulslkm  is  racuiirad.  Tb*  analysis  pre¬ 
sented  ■  restricted  to  tlw  two-dimensional  scalar  cnee.  Gratings  may 
ha  specified  dinctly  or  a*  the  result  of  photographic  raoordirgs  of  an 
interference  pattern  in  an  absorbing  medium.  The  electric  field  in 
the  grating  is  described  in  terms  of  an  integro-differantial  equation. 
In  simple  n—  of  single  sinusoidal  gratings,  well-known  coupled -wove 
aquation*  may  be  derived  from  this  equation.  Fc*  more  ganeral  cases 
numerical  solutions  (necessitating  the  use  of  discrete  tpactra)  are 
obtained.  (13  min.) 

faBI.  Paper  withdrawn. 

TuH7.  Diffraction  Efficiencies  of  Transmission  Absorption 
Grating*,  w. t  Baird. t.k  Gaylord. and v. g  moharam. School 
Of  Electrical  Engineering,  Georgia  Institute  of  Technology.  Atlanta. 
Georgia  3033 2  — The  diffraction  characteristic*  oT  planar  (co)ei- 
nusoidal  conductivity  (absorption)  transmission  gratings  an  deter¬ 
mined  using  rigorous  coupled-wave  theory.  The  first-order  end 
higher-order  diffraction  efnd'Dciee  at*  determined  over  the  entire 
range  of  possible  conductivities  and  Bragg  angles  of  incidence  (or 
equivalently,  grating  periods)  foe  If -mode  polarisation  incident  plane 
waves.  Rigorom  result*  are  compared  enth  approximate  leeuhs  from 
th*  RameivNath  theory  and  the  two-wsve  first-order  coupled-wave 
(Kogelnik)  theory.  The  global  maximum  diffraction  efficiency  lor 
the  first -order  transmitted  diffracted  wav*  was  found  to  be  3.26V 
rather  than  the  4.80*  or  3.70%  predicted,  respectively,  by  the 
Ramar-Nath  and  Kogelnik  approximate  theories.  A  regime  pa¬ 
rameter  ia  defined  that  delineate*  th*  region  of  Raman-Nsth  dif¬ 
fraction  behavior  and  the  region  of  two-ware  first-order  diffraction- 
theory  behavior.  Likewise,  the  angular  selectivity  characteristics  of 
conductivity  gratings  era  determined  from  rigorous  theory  and  are 
compared  with  corresponding  results  from  approximate  theory.  (13 
min.) 

TuHA.  Paper  withdrawn. 
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Residue  number  system  holographic  truth-table  look-up 
processing:  detector  threshold  setting  and 
probability  of  error  due  to  amplitude  and 
phase  variations 

M.  M.  Mirsalehi,  C.  C.  Guest,  and  T.  K.  Gaylord 


4.:  ■ 


The  UM  of  *  holographic  eontent-xddreeaeble  memory  system  for  parallel  truth-table  look-up  digital  data 
procesaing  is  analysed.  For  binary -coded  residue  numbers,  the  operations  of  4-.  8-,  I2-.  and  16-bit  addition 
and  multiplication  are  treated.  The  minimum  probability  of  error  that  ran  he  achieved  and  the  correspond¬ 
ing  detector  threshold  settings  are  determined  in  eech  case  aJknving  for  the  effecta  of  Gaussian  distributions 
in  the  amplitude  and  the  phase  in  the  recording  beams.  Resultant  probabilities  of  error  foe  practical  condi¬ 
tions  are  found  to  be  very  competitive  with  those  from  state-of-the-art  nonparalirl  technologies. 


L  Introduction 

A.  Truth-Table  Look-Up  Processing 

Most  functions,  transformations  and  operations  may 
be  represented  by  a  binary  truth-table  in  which  the 
outputs  for  all  possible  input  combinations  are  given. 
Direct  implementation  of  truth-table  look-up  processors 
was  previously  uncommon  in  data  and  signal  processing. 
However,  in  recent  years  it  has  become  steadily  more 
commonplace.  There  are  three  general  methods  used 
for  direct  implementation  of  a  truth-table.  These 
are: 

(X)  Location-Addressable  Memory.  In  this  type  of 
truth-table  look-up  processor,  the  entire  truth-table  is 
stored  in  a  direct  or  location-addressable  memory  such 
as  an  electronic  read-only  memory.  This  is  a  straight¬ 
forward  implementation,  but  it  is  very  inefficient  in 
terms  of  required  storage. 

(2)  Hardware  Logic  Gates.  A  truth-table  may  be 
implemented  directly  without  any  memory  look-up 
through  the  use  of  Boolean  logic  gales.  Each  binary 
output  variable  when  represented  as  a  sum  of  products 
(or  product  of  sums)  of  binary  input  variables  may  be 
implemented  with  three  levels  of  logic  in  the  form  of  a 


Th*  authors  art  with  Georgia  Imtitut*  of  Technology,  School  of 
ElectrireJ  Engineering.  Atlanta,  Ceorg  i*  303.12. 
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programmable  logic  array  (PLA).  For  a  sum-of-prod- 
ucts  form,  the  sequence  of  logic  gates  is  NOT.  AND.  OR. 
Electronic  integrated  circuit  implementations  of  PLAs 
ate  commonly  achieved  with  large-scale  integration 
(LSD  and  very  large-scale  integration  (VLSD. 

(3)  Content -Addressable  Memory.  A  truth-table 
look-up  processor  may  be  implemented  using  a  con¬ 
tent-addressable  memory.  In  these  systems,  for  each 
output  bit,  the  combinations  of  inputs  are  stored  that 
cause  this  output  bit  to  be  a  logical  one.  The  inputs  are 
compared  with  the  stored  tables,  and  detected  matches 
cause  the  appropriate  output  bits  to  be  logical  ones. 
The  sizes  of  the  stored  truth  tables  may  typically  be 
greatly  reduced  using  methods  of  logical  reduction  such 
as  the  Quine- McCluskey  method1  or  the  Tison  algo¬ 
rithm.2  Content-addressable  memories  have  been 
technologically  difficult  to  construct  However,  optical 
holographic  systems  are  natural  content-addressable 
memories,  and  this  optical  technology  appears  to  be 
very  promising  for  this  type  of  application.3  Optical 
holographic  content  -addressable-memory  truth-table 
look-up  processing  is  analyzed  in  this  paper. 

B.  Residue  Number  System 

The  use  of  residue  arithmetic  in  computing  has  been 
extensively  studied  over  many  years.4-4  Residue 
arithmetic  has  a  number  of  well-known  advantageous 
features.  Residue  arithmetic  has  recently  been  shown 
to  be  extremely  efficient  when  applied  to  content- 
addressable-memory  truth-table  look-up  processing.3 
In  residue  arithmetic,  the  calculations  associated  with 
each  modulus  are  independent  of  the  calculations  as¬ 
sociated  with  the  other  moduli.  For  example,  there  are 
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bo  input  or  output  ctniM  between  digit*  in  residue 
arithmetic.  In  mm  nee,  thi*  mean*  that  there  i*  one 
(relatively  tmall)  truth  table  aaaociated  with  each 
modulus  uaed.  The  range  of  number*  represented, 
which  ia  equal  to  the  product  of  the  relatively  prime 
moduli,  may  be  increased  by  aimply  including  addi¬ 
tional  moduli  (and  their  aaaociated  truth  tables).  The 
residue  number*  associated  with  each  modulus  may  be 
encoded  in  binary  fora.  This  produces  a  digital  (rather 
than  analog)  processor  with  its  many  accompanying 
well-known  advantages.  This  binary-coded  represen¬ 
tation  of  residue  numbers  is  used  in  this  paper. 

C.  Residue  Number  System  Holographic  Truth-Table 
Look-Up  Processing 

An  optical  holographic  system  functioning  as  a  con¬ 
tent-addressable  memory  to  implement  a  truth-table 
look-up  processor  that  operates  on  binary -coded  residue 
numbers  is  analyzed  in  this  paper.  The  operations 
treated  are  addition  and  multiplication.  These  are  the 
basic  operations  required  to  implement  most  digital 
processing  algorithms  such  as  the  discrete  Fourier 
transform,  1-D  and  2-D  digital  filtering,  matrix  ma¬ 
nipulations.  power  series  evaluations,  etc.  To  analyze 
the  complexity,  functioning,  and  performance  of  a  res¬ 
idue  number  system  holographic  truth-table  look-up 
processor  to  perform  addition  and  multiplication  for 
given  word  lengths,  a  number  of  pieces  of  information 
are  needed.  These  include 

(1)  the  optimal  set  of  moduli  to  use; 

(2)  the  sizes  (memory  requirement)  of  the  truth  ta¬ 
bles  to  be  used; 

(3)  the  optimal  threshold  seUing(s)  for  the  detectors; 
and 

(4)  the  total  probability  of  error  in  performing  an 
operation  considering  the  optical  wave  amplitude  and 
phase  variations  during  recording.  (During  readout 
processing,  only  a  single  incident  wave  is  needed,  and 
to  the  system  operation  is  insensitive  to  the  amplitude 
and  phase  variations  in  the  reconstruction  wave.) 

The  first  two  pieces  of  information  are  independent 
of  the  technological  implementation  of  the  system  and 
have  recently  been  determined.7  This  necessary  in¬ 
formation  is  used  in  the  analysis  presented  here.  It  is 
the  purpose  of  this  paper  to  analyze  the  performance  of 
an  optical  holographic  implementation  of  a  residue 
number  system  truth-table  look-up  processor.  In  this 
analysis,  the  optimal  detector  threshold  seUing(s)  and 
the  total  probability  of  error  are  determined  for  the 
addition  and  multiplication  of  4-,  8-,  12-,  and  16-bit 
words. 

a.  System  Operation 

The  truth-table  look-up  optical  processor  operates 
on  bine *y -encoded  numbers.  In  the  present  case  the 
numbers  are  binary-encoded  residue  numbers.  Each 
system  output  bit  is  implemented  as  a  separate  Boolean 
logical  function  of  the  input  bita.  This  is  dons  by 
having  input  patterns  from  the  truth  table  of  each 
function  that  cause  the  output  bit  to  be  a  logical  one 
stored  in  the  system.  Inputs  to  the  system  are  com- 
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Fig. ).  Holographic  truth  Ubl*  look-up  procoaor  (a)  recording 
ihc  truth-table  hologram*  <b)  practising  of  binary  input  data. 


pared  with  the  stored  patterns,  and  detected  matches 
cause  the  appropriate  output  bita  to  be  logical  ones. 
Output  bita  for  which  no  match  is  detected  are  logical 
zeros. 

By  locating  stored  binary  patterns  that  match  the 
input  pattern,  the  processing  system  functions  as  a 
content-addressable  memory*  Other  methods  of 
implementing  content-addressable  or  associative 
memories  optically  have  relied  on  producing  cross  cor¬ 
relations  by  multiplication  in  the  Fourier  plane.9-10 
Such  systems  produce  outputs  that  are  analog  in  nature 
and  arc  not  specific  as  to  which  portions  of  the  input 
field  match  the  stored  patterns.  The  type  of  optical 
content-addressable  memory  analyzed  here  does  not 
have  these  difficulties.3  Its  operation  as  a  truth-table 
look-up  processor  is  analyzed  in  this  paper. 

The  pai  ticular  content-addressable  memory  system 
treated  here  is  based  on  an  optical  NAND  operation. 
Reference  patterns  are  holographically  stored  and  fixed 
in  a  recording  medium  such  as  an  electrooptic  crystal. 
The  Boolean  logical  NANO  operation  is  implemented 
in  manner  similar  to  the  holographic  NAND  operation 
demonstratedbyPreston.11  In  the  NAND  system  de¬ 
scribed  here,  the  reference  beam  steps  sequentially 
through  a  series  of  spatial  positions  in  the  input  plane 
of  the  Fourier  transform  configuration  as  shown  in  Fig. 
1  (s).  Each  reference  beam  position  corresponds  to  a 
different  reference  pattern  to  be  recorded.  Each  ref¬ 
erence  pattern  is  recorded  by  using  a  phase-shifting  line 
composer  (a  particular  type  of  spatial  light  modulator) 
containing  one  more  element  than  the  number  of  bita 
in  each  pattern.  The  extra  element  is  designated  as  a 
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reference  bit  and  iiMtlti  phase  shift  of  0*  for  all  re¬ 
cordings.  All  the  bits  that  are  ones  in  the  reference 
pattern  are  recorded  with  a  phase  of  180*,  all  the  bits 
that  are  xeroa  are  recorded  with  a  phase  of  0*  (the  same 
phase  as  the  reference  bit),  and  don't-care  bits  are  not 
recorded  (spatial  light  modulator  does  not  pass  light  at 
that  bit  position).  The  light  amplitude*  associated  with 
all  the  one  and  zero  bits  are  eoual.  The  amplitude  as¬ 
sociated  with  recording  the  reference  bit  is  R  times  the 
amplitude  of  each  one  (or  zero)  bit,  where  R  is  the  total 
number  of  ones  in  the  particular  reference  pattern  being 
recorded.  A  complete  reference  pattern  is  recorded  as 
described  for  a  single  position  of  the  reference  beam. 
The  reference  beam  is  then  stepped  to  a  new  location, 
and  a  new  reference  pattern  is  entered  into  the  spatial 
light  modulator  and  a  holographic  recording  made. 

After  all  reference  patterns  are  recorded,  the  phase- 
shifting  line  composer  is  replaced  by  an  on-off  ampli¬ 
tude  line  composer  as  shown  in  Hig.  1(b).  The  system 
is  then  able  to  function  directly  as  a  truth-table  look-up 
processor.  A  binary  data  pattern  is  entered  into  the 
on-off  amplitude  line  composer.  For  one  bits  in  the 
input  data  pattern  the  spatial  light  modulator  is 
transparent,  and  for  zero  bits  it  is  opaque.  If  (and  only 
if)  the  input  data  pattern  matches  a  prerecorded  ref¬ 
erence  pattern,  then,  upon  reconstruction,  there  will  be 
wavefront  cancellation  in  the  direction  of  the  particular 
reference  beam  that  corresponds  to  that  reference 
pattern.  This  is  detected  by  an  array  of  photodetectors 
with  each  position  in  the  array  corresponding  to  a  dif¬ 
ferent  reference  beam  position  during  recording.  Thus, 
each  detector  corresponds  to  a  particular  binary  refer¬ 
ence  pattern,  and  when  wave-front  cancellation  (a  null) 
occurs  at  that  detector,  a  match  of  the  input  data  to  that 
reference  pattern  has  been  found.  Each  recording  may 
be  thought  of  as  producing  an  optical  NAND  gate.  The 
phase  shifts  of  the  phase-shift  line  composer  during 
recording  determine  which  data  bits  during  processing 
will  be  presented  to  the  NAND  gate  in  complemented 
form  (the  ones  in  the  input  data  pattern)  and  which  will 
be  presented  in  uncomplemented  form  (the  zeros  in  the 
input  data  pattern).  During  processing,  the  output  of 
the  NAND  operation  then  occurs  in  the  detector  array 
plane  at  the  reference  beam  position  for  that  re¬ 
cording. 

This  form  of  processor  may  be  extended  to  operate 
on  many  input  data  words  sim.ilt&neously  (in  parallel).3 
The  set  of  holograms  recorded  as  described  ab-ivt  may 
he  read  out  using  multiple  binary  amplitude  line  com¬ 
posers  displaced  above  and  below  the  recording  plane 
of  incidence  so  that  the  fundamental  Bragg  condition 
is  still  satisfied.13  In  other  words,  the  lack  of  angular 
selectivity  in  the  direction  perpendicular  to  the  plane 
of  incidence  for  recording  is  used  to  al'ow  the  recon¬ 
struction  of  the  holograms  by  many  input  data  patterns 
simultaneously. 

For  the  holographic  truth-cable  look-up  processor 
described,  the  detectors  need  only  to  detect  whether  the 
light  amplitude  at  that  particular  detector  is  above  or 
below  the  threshold  value  and  thus  a  zero  or  a  one.  re¬ 
spectively.  It  may  be  possible  to  set  the  threshold  value 


for  each  detector  (corresponding  to  a  particular  refer¬ 
ence  pattern)  separately,  or  there  may  be  only  one 
threshold  value  for  the  entire  array  of  photodetectors. 
In  either  case,  it  is  necessary  to  know  how  the  thresh- 
old(s)  should  be  set  in  practice  to  minimize  the  total 
probability  of  error.  This,  in  turn,  depends  on  the 
amplitude  and  phase  variations  about  the  required 
values  of  the  laser  beams  during  recording  of  the  ref¬ 
erence  patterns  (fabrication  of  the  processor).  Note, 
however,  thst  whrn  operated  as  a  processor,  the  system 
is  insensitive  to  amplitude  and  phase  variations  in  the 
beam  that  passes  through  the  spatial  light  modulator 
as  i*  is  the  only  wavefront  in  the  «yslem,  and  there  is  no 
need  for  precise  control  on  this  single  beam  that  is  re¬ 
constructing  the  set  of  recorded  holograms.  During 
recording,  the  amplitudes  and  phases  of  the  beams  are 
obviously  of  critical  importance.  In  the  following  sec¬ 
tions  the  optimal-threshold  setting(s)  of  the  detectors 
and  the  minimum  total  probability  of  error  in  per¬ 
forming  addition  and  muitipticalion  will  be  determined 
as  a  function  of  amplitude  and  phase  variations  in  the 
beams  during  recording  of  the  reference  patterns. 

III.  Mathematical  Modal 

A.  Distributions  of  the  Recording  Waves 

In  the  following  analysis,  each  recording  wave  is 
considered  to  be  a  plane  wave  and.  therefore,  is  repre¬ 
sented  by  an  amplitude  in  the  complex  plane  (a  phasor). 
The  amplitude  and  the  phase  of  each  recording  wave 
(relative  to  the  reference  wave)  are  considered  as  two 
independent  random  variables,  and  it  ls  assumed  that 
both  have  Gaussian  distributions  centered  on  the  design 
values.  The  Gaussian  distribution  is  commonly  as¬ 
sumed  when  the  deviation  is  caused  by  multiple  phys¬ 
ical  phenomena.  This  assumption  is  based  on  the 
fundamental  (central)  limit  theorem  in  probability 
theory  .13  The  probability  density  function  for  the  ith 
wave  contributing  to  the  reconstruction  can  be  written 


P,l*4)  * 


where  a  and  0  are  the  amplitude  and  the  angle  param¬ 
eters  in  polar  coordinates,  a,  and  0,  are  the  design  values 
of  the  amplitude  and  the  phase  which  were  expected  to 
be  recorded,  and  <r0,  and  <r»,  are  the  standard  deviations 
in  the  amplitude  and  the  phase,  respectively,  from  the 
design  values.  The  angle  6  is  measured  in  the  range 
from  (0,  —  ir)  to  {0,  +  i)  to  prevent  inconsistency  at  the 
two  limiting  points.  Since  the  amplitude  should  be 
positive  (o  >  0)  and  the  phase  is  restricted  to  a  2t  range, 
use  of  a  Gaussian  distribution  may  seem  at  first  to  be 
incorrect  because  the  parameters  do  not  have  a  variation 
range  of  — •  to  +■>.  However,  for  any  practical  pro¬ 
cessor  «  o,  and  «  2r,  and  these  restrictions  on 
the  amplitude  and  phase  have  no  significant  effect. 

As  described  in  Sec.  11,  three  type*  of  waves  are  re¬ 
corded.  The  first  two  types  correspond  to  1  and  0  bits 
in  the  recording  pattern.  They  both  have  unit  ampli- 
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tude J3|t>  ■  5(oi  -  1|,  but  their  phases  are different  (5(n 
■  *,tf,  m  0|.  The  third  type  correspond)'  to  the  refer¬ 
ence  bit  Its  phase  is  the  reference  zero  phase  in  the 
system  j8<*>  ■  0],  and  its  amplitude  is  R,  where  R  is  the 
number  of  1  bits  in  the  recording  pattern  [a,*)  »  /?). 
Considering  the  physical  construction  of  the  processor, 
it  is  assumed  that  t  he  standard  deviations  of  the  phase 
for  the  three  types  are  equal,  i.e., 

,2) 

Since  the  phasors  corresponding  to  C  and  1  bits  are 
similar  in  amplitude,  their  amplitude  standard  devia¬ 
tions  are  assumed  equal: 

ir,.,.  «  i  a.  f3) 


In  the  general  case,  the  convolution  process  must  be 
done  numerically.  Two-dimensional  convolutions  are 
computationally  intensive,  ao  in  our  analysis,  we  re¬ 
duced  the  number  of  operations  by  defining  a  window 
matrix  for  each  phasor  and  convolving  only  the  elements 
of  these  matrices.  F-ven  this  method  requires  Urge 
matrices  for  practical  case*,  where  the  standard  devia- 
tjona  are  small  (a  few  percent  I.  Through  the  convolu¬ 
tion  process,  the  region  around  the  expected  value  of 
each  complex  amplitude  |i.e.,  around  the  <x,,0>  point) 
is  of  primary  itr  portance  in  determining  the  final  dis¬ 
tributions.  In  this  region,  which  is  characterized  by  y 
cv  Qand  |x|  »  |x,|  »  |y|,  the  actual  distribution  given 
by  Eq.  (6)  can  be  approximated  by 


The  amplitude  standard  deviation  of  the  reference  bit 
is  a  function  of  the  value  of  R  ana  is  assumed  to  be 

This  assumption  is  based  on  the  fact  that  the  reference 
bit  can  be  considered  as  the  sum  of  R  independent 
amplitudes,  each  of  them  having  a  standard  deviation 
of  ot.  From  probability  theory,  tne  variance  of  the  sum 
of  several  independent  variables  is  the  sum  of  the 
variances  of  each  variable,1*  therefore,  a -  <r;  +  ai 
+  .,.■*  Ra-„.  Although  the  conditions  (2)-(4)  have 
been  used  in  the  present  work,  the  method  of  analysis 
applies  equally  to  other  conditions  that  may  occur. 

Having  the  probability  density  function  of  each 
constituent  wave  |Eq.  (1)),  the  next  step  would  be  to 
find  the  density  function  cf  the  resultant  phasor.  The 
resultant  complex  amplitude  is  obu  ined  by  the  vecto¬ 
rial  sum  of  the  phasors  that  are  present  during  the 
reading  process.  In  the  1-D  case,  if  a  random  variable 
is  the  sum  of  two  independent  random  variables,  its 
density  function  can  be  obtained  by  convolving  Ihe 
density  functions  corresponding  to  the  two  random 
variables.14  That  is,  if  x  ■=  xi  +  x*.  tkenp(x)  =  Pi(x) 
•  Px(x),  where  p\,  p2,  and  p  are  the  density  functions 
corresponding  to  X|,  Xj,  and  x,  and  •  represents  con¬ 
volution.  This  property  can  be  directly  extended  to  the 
2-D  case.  That  is,  if  two  phasors  =  X|  +  jy\  and  a- 
“  zj  -*  jy2  are  independent,  the  density  function  of  their 
sum  phasor  (a  =  a:  +  a2)  can  be  obtained  by  the  2-1) 
convolution  of  the  two  density  functions, 

Pi»X|sV|»Pj(I  -  xio  -  V(Wl|</.V|.  (5) 

To  use  the  above  property,  a  change  of  the  random 
variables  from  amplitude  and  phase  (o  and  6)  to  real 
and  imaginary  components  (x  and  y)  is  required.  The 
equivalent  form  of  Kq.  (1)  for  the  new  random  variables 
would  be: 

P,<‘&)  •  ■  ■  ■ 1 - - 

2»!xl  +  y’>‘'V,,a,, 

X*ipH<«»+  y,ll'J-S.|’/2e’.t 

Xtipt-|un-'(y/x)-J,|-72«U  (61 

The  density  function  of  the  resultant  complex  ampli¬ 
tude  can  then  be  obtained  from  the  density  functions 
of  the  constituent  complex  amplitudes  by  repeating  the 
convolution  as  many  times  as  required. 


p.lxyl »  - — - — *ip(-<x  -  r.i,/2«U‘sp<->V2«i).  (?) 

where  a,,  ■  a*,  and  a,,  *•  |x,  |  o«,  are  the  approximated 
values  of  the  standard  deviations  of  the  new  random 
variables  (x  and  y ).  The  convolution  of  two  Gaussian 
distributions  as  given  by  Eq.  (7)  can  be  done  analytically 
and  results  in  another  Gaussian  distribution.  That  is. 
if 

P:\xy)  -  - - txp|-(x  -  X,)>/2«*,|  npt-yS/2**,).  (61 

2**„«„ 

”  - — - Mp|-(1  -  expt-yV2<4,).  (9) 

then 

p(*y)  ■  Pi(xy)  *  pAxjt 

X  expf-lx  -  (?,  +  +  <r?,l| 

X  expl-y’rttej,  ♦  «;,)).  (10) 

This  relationship  can  be  extended  to  any  number  of 
Gaussian  functions. 

To  test  the  accuracy  of  the  above  approximation,  the 
threshold  analysis  was  performed  on  both  density 
functions  |Eqs.  (6)  and  (7))  for  different  patterns  with 
a  range  of  standard  deviations  (om  »  <r*  «  o)  from  0.04 
to  9.11  with  0.01  intervals.  The  results  showed  that  the 
threshold  values  corresponding  to  the  two  types  of 
distributions  become  more  nearly  the  same  as  a  de¬ 
creases.  The  percenlcge  deviation  between  them  is 
<1%  for  a  <0.07. 

B.  Optimum-Threshold  Condition 
To  deal  with  the  problem  of  threshold  setting,  some 
terms  which  are  useful  in  error  analysis  are  first  defined. 
These  terras  are  introduced  for  1-D  distributions  as 
usually  discussed16  and  then  are  extended  to  the  2-D 
case  of  interest. 

Figure  2  shows  two  density  functions  centered  at  x 
■  0  and  x  ■  1 ,  which  correspond  to  the  logical  states  0 
and  1,  respectively.  If  the  threshold  is  set  at  x  *  x,s. 
the  detected  signal  is  incorrect  for  a  transmitted  1  if  x 
<  its  and  for  a  transmitted  0  if  x  >  x,s.  If  the  case  of 
interest  is  detecting  a  0,  the  first  type  of  error  results  in 
a  false  alarm,  while  the  second  type  results  in  a  miss. 
The  probabilities  of  these  two  events  are 


5'-% 


3SM  «HtO  OPTICS  /  Wat.  22,  No  22  /  15  November  1931 


Fig.  2.  Probability  detuitin  is)  the  1-D  raat.  A  gentftl  threiKoki 
setting  lxl)t)  and  the  op limum- threshold  setting  Us)  ait  shown. 
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Fig*  (a)  Example  phasor  diagram  showing  the  complex  amplitude 
of  the  n\t  front  for  each  bit  (b)Phasor  diagram  showing  all  possible 
resultant  amplitude*  at  a  particular  detector.  Numbers  indicate 
degeneracy  of  ^hasor. 


0,-Pl  J^]*Pl(«W*.  Pm  ■  P.  J^'pdxWl.  (11) 

where  P/„  and  Pm  are  the  probabilities  of  false  alarm 
and  miss,  respectively,  and  Pi  and  P0  are  the  corre¬ 
sponding  probabilities  of  receiving  a  1  and  a  0  (Pi  +  P0 
■  I).  The  probability  of  error  in  detecting  a  signal 
correctly  is  the  sum  of  the  above  two  types  of  errors: 

P,  -  Pi  £%i( iWx  +  Pof  ''p.t,^  (12) 

The  optimum-threshold  setting  ix0)  which  corresponds 
to  the  minimum  error  can  be  obuined  by  solving  dP,/ 
dxtk  -  0  for  x,i,  subject  to  d’P./dx?*  >  0.  The  result 
is  Po(xo>/pi(xo)  •  P|/P»  For  the  special  case  when  the 
probabilities  of  receiving  0  and  1  are  equal,  i.e..Po  *  Pi, 
the  optimum  threshold  is  at  the  intersection  of  the  two 
density  functions. 

The  present  case  involves  2-D  rather  than  1-D  dis¬ 
tributions.  To  illustrate  the  analysis,  the  simple  case 
where  the  modulus  is  4  and  the  recorded  pattern  is  1010 
is  considered.  The  corresponding  phasor  diagram  is 
shown  in  Fig.  3.  In  the  reading  process,  if  the  input 
pattern  matches  the  recorded  hologram,  the  resultant 
complex  amplitude  has  a  distribution  around  the  origin. 
Any  other  input  pattern  results  in  a  mismatch  case, 
where  the  resultant  complex  amplitude  has  a  distribu¬ 
tion  around  a  positive  integer  number  on  the  real  axis. 


To  illustrate  the  problem  of  threshold  setting,  the  dis¬ 
tributions  which  correspond  to  Fig.  3(b)  are  shown  in 
Fig.  4.  Since  the  detectors  respond  to  the  light  inten¬ 
sity,  the  threshold  corresponds  to  a  magnitude  oi  the 
amplitude  (o,»).  The  probabilities  of  miss  and  false 
alarm  are 

Pm  " P*  f  f*  poloSWMo,  (1.1) 

F*- jf/*,  P*  f,r p,taJXIMo.  (14) 

where  Po  and  P,  are  the  probabilities  of  the  match  and 
the  ith  mismatch,  and  po(o,6)  and  p, (a, 6)  are  the  den¬ 
sity  functions  corresponding  to  the  match  and  the  tth 
mismatch  cases.  The  parameter  Nm  is  the  total  num¬ 
ber  of  mismatch  cases.  As  in  the  previous  situation,  I  he 
probability  of  error  is  obtained  by  adding  the  above  two 
probabilities: 

P,  -  Pt  C  f'^ctWNa 

J«"*M  s/M 

Nm  am  /»2r 

+  £/>.  I  I  pAaMMu  (15) 

|«|  Jl«0  J*s* 

The  optimum  threshold  (ao)  can  be  obtained  by  solving 
dP,/da,h  ~  C  for  a,h  subject  to  <fJP,/da?»  >  0.  The 
result  is 

J*»  if  P,  f,'p,(a»,*)d»  (16) 

•/*•«  «•)  */»•» 

Assuming  equal  probabilities  for  any  poMible  pattern 
in  the  input,  Po  •  P,  *  UN,n,  where  N,„(“  Nm  +  1)  is 
the  number  of  passible  input  patterns.  Using  this,  Eq. 
(16)  can  be  written  as 

f"  PotaoJtdl  •  E  CUpMo*M  (17) 

|B|  s/hI 


Fig.  4.  Probability  density  function  of  the  resultant  complex  am¬ 
plitude  corresponding  tit  the  binary  pattern  1010.  The  optimum- 
threshold  amplitude  setting  is  shown.  This  figure  is  plotted  ft*  the 

t,  *  «e«0J0  case. 
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The  above  aquation  shows  that  the  optimum  thmhold 
(at)  haa  tbs’ following  property.  If  a  circle  centered  at 
tba  origin  with  radius  oo  ia  chosen  as  the  contour,  the 
contour  integral  of  the  match  ease  density  function  is 
equal  to  the  sum  of  the  contour  integrals  of  the  density 
functions  corresponding  to  all  the  mismatch  cases. 
Having  the  density  functions  of  the  match  and  the 
mismatch  cases,  the  optimum  threshold  can  be  obtained 
by  solving  Eq.  (17)  using  a  successive  numerical  ap¬ 
proximation  technique.  ThesummationinEq.  (17)is 
over  all  the  mismatch  cases.  However,  since  a,  ,c» « 
1,  only  those  cases  which  have  distributions  closest  to 
the  match  distribution  have  a  significant  effect  The 
term  “nearest  mismatch"  is  used  to  represent  these 
cases.  Unlike  the  1-D  esse,  the  optimum  threshold 
which  is  obtained  from  Eq.  (17)  in  general  is  not  at  the 
intersection  of  the  match  and  the  sum  of  the  nearest 
mismatch  distributions.  This  is  illustrated  in  Fig.  5. 

C.  Mt/hods  of  Threshold  Sotting 

Two  Methods  of  threshold  setting  have  been  inves¬ 
tigated.  These  are  called  the  separate-setting  and  the 
cotnmor.  setting  techniques.  In  the  separate-setting 
techniq  j  ;,  each  detector  is  aet  to  its  own  optimum 
threshold  value,  which  is  obtainable  from  Eq.  (17).  In 
the  common-setting  technique  a  single  setting  is  used 
for  the  entire  array  of  detectors  This  value  is  chosen 
so  that  the  total  error  in  reading  the  output  word  is 
minimized.  Each  technique  has  its  own  advantage. 
The  separate  setting  would  result  in  smaller  error  val¬ 
ues,  while  the  common  aetting  would  be  easier  to  in¬ 
strument. 

D.  Total  Probability  of  Error 

The  probability  of  error  for  one  detector  was  dis¬ 
cussed  in  Sec.  I1I.B.  Depending  on  the  modulus  used, 
each  output  word  has  a  number  of  bits,  and  each  bit  is 
determined  by  the  combined  action  of  a  group  of  de¬ 
tectors.  To  read  the  output  word  correctly,  all  the  bits 
must  be  correct.  This  requires  that  the  corresponding 
detectors  individually  detect  correctly.  It  is  assumed 
that  (1)  the  error  for  each  detector  is  independent  of  the 
errors  for  the  other  detectors  of  the  same  bit,  and  (2)  the 
error  for  a  bit  is  independent  of  the  errors  for  the  other 
bits.  Based  on  these  assumptions,  the  probability  of 
correct  detection  of  the  output  word  (P,)  is 

f.-  il  ft  US) 

cl  *-i 

where  J  ia  the  number  of  bits  in  the  output  word,  and 
Kj  is  the  number  of  detectors  corresponding  to  the  ;th 
bit  The  parameter  P,o*i  is  the  probability  of  error  for 
the  kth  detector  corresponding  to  the  >th  bit  This 
parameter  can  be  obtained  by  the  method  discussed  in 
Sec.  I1I.B.  Using  Eq.  (18),  the  probability  of  error  in 
the  output  word  would  be 

ft-i-P.-t-  il  ft  lt-P.g.,1.  US) 

/•  i»-i 

For  the  processor  to  have  a  larger  dynamic  range,  a 
aet  of  moduli  is  selected.  It  is  assumed  that  the  errors 
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amplitude  wttinp  that  product  twin  tha  optimum  error  (oj,  and 
•vl  an  indicated. 


associated  with  the  outcome  for  each  modulus  are  in¬ 
dependent  of  the  other  outcomes.  Following  the  same 
argument,  the  total  probability  of  error  for  the  processor 
(P,t)  would  be 

p..  - 1  -  n  il  rf ;» -  iwii.  «» 

1-1 ,-l »-l 

where  the  index  i  is  added  to  indicate  different  moduli, 
end  /  is  the  number  of  moduli  used  in  the  processor. 
For  the  practical  case,  where  P„  «  1,  the  second-  and 
higher-order  terms  can  be  neglected,  end  Eq.  (20)  is 
reduced  to 

Pw  »  i  t  t  Phm*i-  Bl> 

>-i  ,-it-i 

This  shows  that  the  total  probability  of  error  for  the 
processor  can  be  obtained  by  adding  the  error  proba- 
bilities  of  all  the  detectors. 

E.  Error  Analysis  for  Reduced  Truttt  Tables 
The  logical  reduction  of  truth  tablet  is  e  central  issue 
in  this  type  of  processor  since  it  determines  the  mini¬ 
mum  number  of  required  holograms.1  In  the  reduced 
truth  table,  don’t-care  bit  positions  appear  in  some 
input  words.  In  recording  a  pattern  with  some  don’t- 
care  bits,  the  positions  of  the  don't-care  bits  are  opaque 
at  the  page  composer.  In  the  reading  process,  the  bits 
at  these  positions,  therefore,  do  not  have  any  effect  on 
the  resultant  wsve. 

The  error  analysis  for  the  redo  ed  case  is  more  com¬ 
plicated  than  for  the  uni  educed  case.  Since  each 
don’t-care  bit  can  be  replaced  byeitheraOora  1, if  the 
number  of  don't-care  bits  in  a  hologram  is  N< ,  there 
would  be  1N*  patterns  which  match  that  hologram. 
The  number  of  nearest  mismatch  patterns  is  also  al¬ 
tered.  Disregarding  the  positioia  of  don't-care  bits,  the 
nearest  mismatch  patterns  can  be  divided  into  two 
groups;  (1>  the  patterns  which  are  similar  to  the  match 
case  with  the  exception  of  having  one  0  in  the  place  of 
one  1;  and  (2)  the  patterns  which  are  similar  to  the 
match  case  with  the  exception  of  having  one  1  in  the 


$-2% 


Fig.  6.  Characteristic  probability  of  error  curves  corresponding  to 
Che  logically  reduced  truth  table  for  the  addition  operation  with 
modului  4.  Tire  parameters  of  curves  MaieiUA  -  l.Naa,  m6.N, 
-  I: (2111  *  1.  Wm,  -  IN,  -  2;  (3)  A  -  2.A1.»  -  4.  N,  -  O-.and  (4) 
A  “  4. AC,  -  4,A/e  ■  0.  The  corresponding  degeneracies  of  curves 
1-4  are  4. 2, 1. 1.  respectively.  The  dotted  curve  corresponds  to  the 
probability  of  error  for  the  common-threshold  setting,  technique. 

This  figure  is  plotted  for  the  »«■#!■  0.05  case. 

place  of  ore  0.  In  the  first  group,  each  don't-care  can 
be  replaced  by  either  0  or  1.  The  number  of  this  type, 
*  therefore,  increases  by  a  factor  of  2N<  In  general,  this 
is  not  the  case  for  the  second  group  when  residue 
arithmetic  is  used  since  some  of  the  patterns  correspond 
to  numbers  greater  than  or  equal  to  the  modulus  and, 
therefore,  are  not  allowed.  Since,  in  general,  the 
number  of  nearest  mismatches  is  not  predictable,  an 
exhaustive  computer  search  is  used  to  count  all  the 
possible  cases  for  each  pattern  for  a  specific  operation 
with  a  specific  modulus.  Considering  the  above  two 
effects  (increase  in  the  number  of  match  and  nearest 
mismatch  cases),  the  method  of  error  analysis  would  be 
similar  to  the  unreduced  case. 

IV.  Results 

A.  Probability  of  Error  for  a  Single  Modulus 

To  analyze  the  effect  of  amplitude  and  phase  errors 
on  performing  a  particular  operation  with  a  particular 
modulus,  a  list  must  first  be  compiled  of  all  the  reference 
patterns  that  must  be  recorded  as  holograms.  Each 
hologram  is  characterized  by  three  parameters.  (1)  the 
amplitude  of  the  reference  bit  (A);  (2)  the  number  of 
nearest  mismatch  patterns  (Nn„);  and  (3)  the  number 
of  don't-care  bite  (A/g).  If  these  parameters  are  the 
same  for  two  different  reference  patterns,  the  proba¬ 
bility  of  error  for  a  given  threshold  setting  will  be  the 
same  at  their  corresponding  detectors.  For  each  case, 
a  curve  of  probability  of  error  vs  threshold  setting  is 
obtained  by  doing  the  required  convolutions  and  vol¬ 
ume  integrations  as  described  in  Sec.  III. 

For  illustration,  the  four  characteristic  error  curves 
that  correspond  to  the  logically  reduced  truth  table  for 
the  addition  operation  with  modulus  4  are  shown  in  Fig. 
6.  As  can  be  seen,  the  probability  of  error  varies  dra- 
meticaly  with  the  threshold  setting.  This  dependence 
becomes  more  pronounced  for  smaller  values  of  <ra  and 
»«.  Considering  the  degeneracy  of  each  curve,  the 


probability  of  error  for  the  separate-threshold  setting 
technique  can  be  obtained  from  Eq,  (19)  using  the 
minimum  value*  from  tbeee  curves.  To  find  the  opti¬ 
mum  threshold  for  the  common-threshold  setting 
technique,  the  composite  P,  vs  a,*  curve  is  obtained 
from  the  individual  curve*  using  Eq.  (19).  The  result 
is  shown  as  the  dotted  curve  in  Fig.  6.  The  minimum 
point  of  this  curve  is  the  optimum  choice.  As  can  be 
seen,  the  threshold  setting  is  primarily  determined  by 
the  curve  with  the  largest  reference  bit  size. 

The  probability  of  error  for  the  addition  and  multi¬ 
plication  processes  with  moduli  2-23  was  calculated  for 
two  values  of  standard  deviations  (a.  «  e,  -  0.01  and 
»«  *  »i  ■  0.02)  and  for  both  threshold -setting  tech¬ 
niques.  The  results  are  given  in  Tables  I  and  11  fi>r 
unreduced  truth  tabka  and  Table*  III  and  IV  for  re¬ 
duced  truth  tablet.  These  tables  also  provide  the 
number  of  required  holograms  and  the  optimum 
threshold  value  for  the  common-threshold  setting 
technique  in  each  case.  Since  the  list  of  the  optimum 
threshold  values  for  the  separate-threshold  setting 
technique  is  very  long,  it  is  not  reported  in  this  paper; 
however,  the  corresponding  probabilities  of  error  are 
provided  for  comparison.  Analysis  of  these  tables 
produces  the  followlig  conclusions: 

(I)  The  probability  of  error  for  both  threshold-set¬ 
ting  techniques  strongly  depends  on  the  largest  ampli¬ 
tude  of  the  reference  bit  (fin.,)  among  the  recorded 
holograms.  Probability  of  error  calculations  for  dif¬ 
ferent  moduli  that  have  the  same  largest  reference  bit 
amplitude  (regardless  of  the  operation  being  imple¬ 
mented)  will  result  in  nearly  equal  error  probabili¬ 
ties. 

(21  The  optimum  threshold  for  the  common- 
threshold  setting  technique  strongly  depends  on  the 
largest  reference  bit  amplitude  (Amu). 

(3)  Although  the  probability  of  error  in  all  case*  is 
smaller  for  die  separate-threshold  setting  than  with  the 
common-threshold  setting,  the  two  values  are  similar. 
As  a  result,  a  simple  detector  array  using  the  common- 
threshold  setting  technique  would  not  produce  a  sig¬ 
nificant  increase  in  the  probability  of  error. 

(4)  Comparison  ofTabies  I  and  II  with  Tables  III  and 
IV  results  in  *he  conclusion  that  applying  the  reduction 
techniques  has  the  benefit  -f  both  reducing  the  number 
of  required  holograms  by  a  large  factor  and  significantly 
decreasing  the  error  rate  of  the  processor. 

(5)  The  probability  of  error  occasionally  decrease* 

with  increasing  modulus  size.  This  occurs  either  be¬ 
cause  (1)  the  truth  table  for  the  larger  modulus  has  a 
smaller  (producing  a  large  decrease  in  error)  or  (2) 

Ami  is  unchanged  and  the  number  of  truth-table 
entries  associated  with  Rm„  in  the  larger  modulus  is 
smaller  (producing  a  small  decrease  in  error). 

B.  Total  Probability  of  Error  for  a  Sot  of  Moduli 

To  increase  the  numerical  range  of  the  processor,  a 
set  of  relatively  prime  moduli  may  be  used.  The  cov¬ 
ered  range  is  equal  to  the  product  of  the  individual 
moduli  The  selection  rules  to  choose  a  moduli  set  de¬ 
pends  on  the  parameter  that  is  to  be  optimized.  Since 
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the  error  probabilities  for  all  the  ease*  studied  were 
aufflciently  small,  it  is  appropriote  to  choose  the  moduli 
•et  that  corresponds  to  the  smallest  number  of  reference 
patterns  (rather  than  the  set  that  minimizes  the  prob¬ 
ability  of  error).  The  algorithm  developed  and  the 
optimum  moduli  seta  for  addition  and  multiplication 
of  two  4-,  8-,  J2-,  and  16-bit  words  were  determined  and 
•re  reported  in  another  paper.7  Using  these  optimum 
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moduli  seta,  the  total  probabilities  of  error  were  calcu¬ 
lated  for  the  above  operations  for  two  values  of  standard 
deviation*  in  amplitude  and  phase  during  recording. 
The  reaulte  are  presented  in  Tables  V  and  VI.  These 
tablet  show  that  the  total  error  probabilities  for  this 
processor  with  a  standard  deviation  of  0.01  in  both 
amplitude  and  phase  are  extremely  sirall  and  much 
better  than  required  for  practical  systems.  The  error 
values  for  ■  <r#  »  0.02  art  competitive  with  the  raw 
error  values  of  current-day  computer  technologies. 
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With  the  addition  of  appropriate  error  detection  and 
correction  bite,  these  raw  error  values  can  be  improved 
by  more  than  an  order  of  magnitude.  In  addition,  the 
number  of  required  holograms  is  within  the  state-of- 
the-art  for  volume  holography  in  electrooptic  crys¬ 
tals.'7 


V.  Summary 

The  operation  of  a  holographic  content-addressable 
memory  system  for  truth-table  look-up  digital  data 
processing  was  analyzed.  The  effects  of  Gaussian  dis¬ 
tributions  in  the  amplitude  and  phase  of  the  recording 
beams  were  treated.  The  operations  of  4-,  8-,  12-,  and 


16-bit  addition  and  multiplication  were  analyzed  for 
binary-coded  residue  numbers.  Setting  each  detector 
threshold  to  its  optimum  value  (separate-threshold 
setting  technique)  or  setting  all  detector  thresholds  to 
a  single  optimum  value  (common-threshold  setting 
technique)  is  found  to  produce  very  similar  probabilities 
of  error.  Optimum  detector  threshold  amplitude  set¬ 
tings  are  typically  ~65%  of  the  amplitude  of  a  compo¬ 
nent  wave  corresponding  to  a  tingle  bit.  For  practically 
achievable  standard  deviations  in  the  amplitude  and 
phase  about  the  design  values,  the  resultant  probabili¬ 
ties  of  error  are  generally  <10-4.  Using  logically  re¬ 
duced  versions  of  the  truth  tables  typically  results  in  a 
significant  decrease  in  both  the  probability  of  error  and 
the  number  of  required  holograms. 
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CHAPTER  Z 


INTRODUCTION 

statement  flf  Purpose 

Architectures  for  optical  holographic  digital 
parallel  processing  systems  are  presented  and  examined 
theoretically  and  experimentally.  The  theoretical 
examination  focuses  on  the  efficiency  with  which  information 
required  for  processing  can  be  stored  within  the  optical 
system.  The  experimental  studies  demonstrate  the  principles 
of  the  system  and  examine  factors  that  affect  its 
statistical  reliability. 

Motivation  lor  ths.  Thesis 

Today  there  is  a  large  and  growing  need  for 
computational  power  that  remains  unsatisfied  despite  the 
advances  of  data  processing  over  the  last  two  decades  [1]. 
Among  the  technical  areas  that  would  have  immediate 
application  for  increased  parallel  computational  ability  are 
remote  sensing,  seismic  data  interpretation,  nuclear  and 
molecular  physics  simulations,  meteorology,  air  traffic 
control,  synthetic  aperture  radar,  missile  guidance,  and 
defense  early  warning  systems  12,3].  Solution  of  problems 
in  these  areas  requires  identical  operations  to  be  performed 
on  large  arrays  of  numbers.  Therefore,  investigation  of  new 


methods  to  process  many  seta  of  data  in  parallel  represent 
inportant  and  timely  research. 

Related  greyloua  Hoik 

Optical  data  processors  are  attractive  for  parallel 
operation  because  they  provide  naturally  the  communication 
paths  that  limit  electronic  parallel  designs.  One  of  the 

e» 

earliest  investigators  to  realize  the  potential  of  optical 
digital  processing  was  Preston  (4].  Be  demonstrated  how 
elementary  logic  operations  could  be  performed  using 
holography  to  produce  lightwave  phase  interference  at  the 
detector  plane.  Gaylord,  Heaver,  and  Hagnusson  made  use  of 
a  Boolean  Exclusive  Or  function  in  their  proposed  parallel 
optical  word/signature  detector  [5] .  The  same  principle  is 
used  in  an  integrated  optical  device  built  at  Battelle 
Laboratories  designed  to  analyze  spectral  data  [6] .  Also, 
Buignard  et  al.(7]  have  shown  that  the  result  of  an 
Exclusive  Or  operation  performed  in  parallel  on  two  entire 
pages  of  data  can  be  recorded  directly  as  a  thick  hologram 
in  an  electrooptic  crystal.  The  .Tse  computer  was  an  early 
attempt  to  perform  parallel  digital  computation  using  optics 
[8].  By  connecting  arrays  of  electronic  logic  elements  with 
bundles  of  optical  fibers  it  achieved  parallel  operation, 
but  failed  to  take  advantage  of  the  natural  parallelism  of 
optical  imaging  systems,  Goodman  has  developed  an  optical 
processor  that  uses  the  natural  parallelism  of  optics  to 
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perform  matrix  multiplication  of  numbers  recorded  as  analog 
photographic  transmittances  [9,10].  A  related  class  of 
analog  transmittance  processors  using  acoustooptic  input 
devices  and  iterative  numerical  techniques  is  under 
development  by  Casasent  (11,12).  Another  form  of  analog 
optical  feedback  processor  in  the  work  of  Cedarquist  and  Lee 
has  been  used  to  solve  partial  differential  equations  (13). 
Knight  has  suggested  an  optical  digital  processor  based  on 
an  associative  memory  (14).  The  structure  chosen  for  the 
memory  permits  only  bit-serial  operations  to  be  performed. 
An  optical  processor  for  conversion  between  binary  and 
residue  numbers  and  for  addition  of  residue  numbers  also  has 
been  proposed  (15).  Another  type  of  processor,  being  built 
by  Collins  et  al.,  uses  the  cyclic  nature  of  optical 
lightwave  phase  to  perform  parallel  additions  in  a  residue 
number  system  [16,17].  There  is  considerable  interest  in 
the  Soviet  Union  in  optical  digital  processing.  The 
emphasis  there  is  largely  on  building  fast  adders  and 
multipliers  from  the  elementary  logic  functions  that  can  be 
produced  using  controlled  transparencies  in  various 
configurations  [18,19] .  Although  each  of  the  processors 
mentioned  represents  a  substantial  contribution  to  the  field 
of  optical  processing,  none  of  them  combine  the  power  of 
parallel  processing  with  the  accuracy  and  flexibility  of 
digital  operation.  Huang  has  presented  a  concept  for  a  very 


general  highly  parallel  digital  optical  processor  {20} *  but 
the  technology  to  inplement  the  processor  and  the  algorithms 
to  apply  it  to  numerical  problems  are  currently  uncertain. 
Reviews  of  the  area  of  optical  digital  processing  can  be 
found  in  References  21  and  22. 

Basic  concepts  and  Definitions 

Two  forms  of  the  holographic  digital  optical 

processor  are  treated  in  this  thesis.  The  forms  will  be 

referred  to  by  the  primary  optical  logic  funtion  each 
performs;  one  will  be  known  as  the  Exclusive  Or  processor , 
the  other  as  the  Rand  processor.  Included  in  both  forms  is 
an  input  device,  lenses,  a  recording  medium,  and  an  output 
device.  The  input  device  will  be  called  a  line  or  page 
composer,  and  consists  of  a  one-  or  two-dimensional  array  of 
elements.  Each  element  may  be  controlled  to  alter  the 
phase,  or  amplitude,  or  both,  of  light  passing  through  it. 
Both  forms  of  the  processor  use  two  Fourier  transform 
lenses.  This  appellation  results  not  so  much  from  the 
construction  of  the  lens,  which  can  be  simple  convex  in 
form,  as  it  does  from  the  position  and  use  of  the  lens  in 

the  system.  The  hologram  recording  medium  is  an 

electrooptic  crystal;  specifically,  lithium  niobate  was  used 
for  the  thesis  experiments.  The  output  device  is  a  one-  or 
two-dimensional  array  of  detector  elements.  Two  coherent 
beams  of  light  are  used  by  the  processor.  The  beam  that 
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passes  through  the  input  cask  is  called  the  object  bean ,  the 
other  is  the  reference  bean. 

The  function  performed  by  the  processor  is  defined 
during  the  fabrication*  or  hologram  recording*  stage.  The 
function  is  performed  on  sets  of  input  data  during  the 
operation  stage.  Information  recorded  during  the 
fabrication  stage  is  entered  as  reference  patterns 
represented  in  the  input  mask.  During  the  operation  stage* 
the  input  data  to  be  processed  define  the  input  mask 
patterns.  These  stages  are  analogous  to  fabricating  an 
integrated  circuit,  and  then  using  the  integrated  circuit  to 
process  data. 

Information  holographically  recorded  during 
fabrication  comes  from  the  truth-table  of  the  digital 
operation  that  the  processor  is  to  implement.  Each  bit  of 
the  processor's  output  word  has  an  associated  truth-table 
that  determines  the  bit's  logic  state  based  on  the  input 
data.  The  recorded  information  represents  reference 
patterns  that  are  compared  with  data  to  be  processed.  The 
presence  of  a  matching  pattern  among  the  reference  patterns 
stored  for  a  particular  output  bit  produces  an  indication  if 
the  output  should  be  true:  a  binary  one.  The  absence  of  a 
match  implies  the  output  is  false:  a  binary  zero.  The 
optical  processing  system  functions  as  a  content-addressable 
memory  by  searching  all  stored  reference  patterns  in 
parallel  for  those  that  match  the  input  data.  Operation  of 


the  processor  should  not  be  confused;  however,  with  optical 
associative  processors  that  operate  in  an  analog  manner. 

Considerable  savings  of  the  amount  of  truth-table 
information  that  must  be  stored  can  be  obtained  through 
logical  reduction  of  the  truth-tables  and  through  use  of  a 
residue  number  system  representation  of  the  input  and  output 
data.  The  process  of  truth-table  reduction  introduces 
"don't  care"  bit  positions  into  some  of  the  reference 
patterns.  These  positions  are  ignored  during  the  pattern 
matching  operation. 

OvfijLview  o£  Thasis 

The  following  sections  present  a  detailed  review  of 
the  work  that  has  been  performed  for  this  thesis.  In 
Chapter  II,  the  concepts  of  operation  for  both  forms  of  the 
processor  are  presented  and  compared.  Chapter  III  addresses 
the  problem  of  efficient  storage  of  truth-table  information 
in  hologram  recording.  The  results  of  a  computer  study  on 
the  effect  of  truth-table  reduction  and  the  use  of  different 
number  systems  are  presented.  In  Chapter  IV,  the  equipment 

i 

used  in  the  experiments  and  its  arrangement  are  explained. 

The  experimental  procedures  used  and  the  results  obtained 
are  presented  in  Chapter  V.  In  Chapter  VI,  results  of  the 
computer  and  experimental  studies  are  summarized  and  the 
prospects  for  optical  digital  parallel  processing  are 
assessed. 

s^A 


if 


7 


CHAPTER  XX 

THE  OPTICAL  PROCESSING  SYSTEMS 

Exclusive  fir  Processing 

The  operating  principles  of  the  Exclusive  Or 
processor  are  presented  in  this  section.  A  schematic 
diagram  of  the  arrangement  to  record  the  holograms  is  shown 
in  Figure  la.  Transparent  and  opaque  apertures  in  the  input 
mask  represent  binary  ones  and  zeros  respectively.  Each  row 
in  the  mask  represents  a  pattern  of  input  bits  that  produces 
a  logical  one  in  the  truth-table  for  the  output  bxt. 
Figure  2  shows  the  truth-table  used  to  create  the  mask  in 
Figure  la.  An  object  beam  of  coherent  light  passes  through 
the  mask.  The  two-dimensional  Fourier  Transform  of  its 
transmittance  pattern  is  produced  inside  the  electrooptic 
crystal  by  the  lens.  The  reference  beam  forms  an 
interference  pattern  with  the  transform  and  the  resulting 
fringes  are  recorded  in  the  crystal  by  the  photorefractive 
effect  [23].  When  the  illumination  is  removed,  a  space 
charge  pattern  remains  in  the  crystal.  The  electric  field 
produced  by  the  pattern  modulates  the  refractive  index  of 
the  crystal  through  the  electrooptic  effect  to  produce  a 
phase  hologram  of  the  mask.  To  process  data,  the  input  bit 
pattern  is  placed  in  every  row  of  the  mask,  as  shown  in 
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F'OURE  1.  THE  “EXCLUSIVE  OR"  BASED  FROCESSOR  la)  RECORDING  HOLOGRAM  OF 
TRUTH-TABLE  PATTERNS.  (01  EXAMPLE  OF  DATA  PROCESSING  WITH  INPUT 
DATA  THAT  DOES  NOT  MATCH  ANY  RECOROED  PATTERN,  (el  EXAMPLE  OF 

Data  processing  with  input  data  that  matches  one  recoroed 
PATTERN. 
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Figure  lb.  The  object  beam  passes  through  the  new  mask,  the 
first  lens,  the  crystal,  a  second  lens,  and  falls  on  the 
detector  array.  A  direct  image  of  the  input  mask  is 
produced  at  the  detector  plane  with  the  undiffracted  object 
beam.  Simultaneously,  the  reference  beam  is  holographically 
diffracted  producing  a  reconstruction  of  the  original  mask 
at  the  detector  plane.  The  net  complex  amplitude  result 
depends  on  the  relative  phase  and  amplitude  of  the  object 
and  reference  beams.  For  all  cases  of  interest,  the 
relative  amplitude  of  the  two  beams  is  adjusted  so  that,  at 
the  detector  plane,  light  from  each  of  them  is  of  equal 
amplitude.  At  the  detector  plane  there  will  be,  in  general, 
array  element  locations  where  either  the  object  or  reference 
beam  contribute  light,  or  both  contribute,  or  neither.  If 
the  relative  phase  of  the  beams  is  adjusted  so  that  light 
from  the  mask  image  and  from  the  hologram  reconstruction  is 
in  phase  at  the  detector  array,  the  result  represents  the 
Inclusive  Or  of  the  input  data  page  and  the  stored  data 
page.  This  assumes  that  singly  and  doubly  illuminated 
detector  elements  are  counted  as  logical  ones,  and 
unilluminated  elements  are  logical  zeros.  If  the  phase  of 
one  of  the  beams  i6  adjusted  so  that  the  two  waves  are  180 
degrees  out  of  phase  at  the  detector,  then  destructive 
Interference  will  occur  at  detector  elements  receiving  light 
from  both  sources.  Elements  will  be  dark  if  both  or  neither 
of  the  data  masks  were-  transparent  at  that  location;  they 
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will  bo  bright  if  just  on*  mask  was  transparent.  This 
result  at  the  detector  plane  is  the  Exclusive  Or  of  the  two 
data  pages ,  and  this  nay  be  used  for  nuaerical  processing. 
The  Exclusive  Or  of  any  binary  word  with  itself  results  in  a 
word  of  all  zeros.  Recall  that  each  row  of  the  stored  data 
page  is  a  pattern  of  input  bits  that  causes  the  output  bit 
to  be  a  one,  and  the  current  input  data  bit  pattern  is 
repeated  as  each  row  of  the  input  mask.  Therefore  a  match 
between  the  input  data  and  one  of  the  stored  patterns  is 
indicated  at  the  detector  plane  by  an  entire  row  of  dark 
elements.  The  detector  elements  may  be  connected  so  that 
the  presence  of  any  entirely  dark  row  causes  an  electrical 
indication  that  the  output  bit  is  a  logical  one.  The 
logical  expression  performed  in  parallel  by  the  Exclusive  Or 
processor  for  each  output  bit  is  given  in  Equation  1. 


0  -  Uxx.PuW. ’•v<In#Pra) 1  <» 

where: 

o  is  the  output  hit 
X.  is  the  kth  bit  of  the  input  data 
F?4  la  the  kth  bit  in  the  Jth  row  of  the  recorded 
**  hologram 

b  is  the  number  of  bits  in  the  input  data 
m  is  the  number  of  rows  in  the  recorded  hologram 
pattern 

a  is  the  optically  performed  Excluaive-Or  operation 
v  is  the  Or  operation  performed  by  the  detector 
a  is  the  And  operation  performed  by  the  detector 

Figure  lb  shows  an  example  of  data  processing  where  the 
input  word  does  not  match  any  stored  pattern}  Figure  lc 
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shows  the  case  of  a  natch. 

As  vill  be  explained  fully  in  Chapter  III,  a 
desirable  reduction  of  information  holographically  stored 
can  be  achieved  with  cruth-table  reduction.  This  process 
introduces  ‘don't  care"  bit  positions  in  the  reference 
patterns  that  must  not  affect  the  matching  operation;  if  all 
other  bits  in  a  reference  pattern  match  the  input  data 
pattern,  then  a  match  must  be  signalled  regardless  of  the 
state  of  the  input  bits  in  the  "don't  care*  positions.  This 
can  be  accomplished  with  the  Exclusive  Or  processor  by 
modification  of  the  detector  array.  Detector  elements  in 
"don't  care"  positions  should  behave  as  if  no  light  is 
falling  on  them,  independently  of  the  presence  or  absence  of 
light.  Placing  a  mask,  opaque  at  the  positions  of  the 
"don't  care"  bits,  directly  in  front  of  the  detector  array 
is  a  simple  way  to  achieve  this. 

Hand  ftQcsssiag 

The  Nand  form  of  the  processor  is  capable  of  more 
compact  and  powerful  optical  numerical  processing.  A 
schematic  representation  of  the  optical  system  used  for 
recording  its  holograms  is  shown  in  Figure  3a.  The  data 
mask  in  this  case  is  a  single  row  of  elements.  Each  element 
can  be  transparent  or  opaque  and  can  shift  the  phase  of 
light  by  either  0  or  180  degrees.  For  each  output  bit  of 
the  system  there  is  a  row  of  positions  that  the  reference 
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PIOUAE  1  “NANO"  CASCO  PROCESSOR.  (•)  TRUTH  TACLE  OATA  HOLOGRAM  RECORDING. 

(bl  EXAMPLE  OP  OATA  PROCESSING  WITH  AN  INPUT  PATTERN  THAT  MATCHES 
ONE  RECORDED  PATTERN. 


bean  steps  along  for  hologran  recording.  At  each  position  a 
different  pattern  of  input  bits  that  cause  the  output  bit  to 
be  a  one  is  used  t?  control  the  amplitude  and  phase 
properties  of  the  mask.  The  details  of  this  will  be 
explained  in  a  subsequent  paragraph.  Bolograns  are  recorded 
in  the  electrooptic  crystal  by  the  interference  of  the 
reference  bean  and  the  object  beam.  For  numerical 
processing  of  input  data,  only  the  object  beam  is  used.  The 
input  data  mask  is  of  the  same  form  as  the  mask  used  for 
hologram  recording,  but  need  not  have  a  phase  shifting 
capability.  The  mask  elements  are  transparent  or  opaque  in 
response  to  the  ores  and  zeros  of  the  input  data  word.  As  a 
result  of  the  way  the  holograms  are  recorded,  light  from  the 
input  mask  combines  to  form  optical  Hand  operations  at  the 
detector  plane.  Only  if  the  input  data  match  a  recorded 
pattern  will  a  dark  position  occur  at  the  detects.  The 
detector  elements  can  be  connected  so  that  a  single  darkened 
element  provides  an  indication  that  the  output  bit  is  a 
logical  one.  Figure  3b  shows  an  example  of  data  processing 
with  the  input  data  matching  one  recorded  pattern. 

An  example  of  determining  the  state  of  the  input  mask 
from  a  truth-table  entry  is  shown  in  Figure  4a.  The  input 
ma»k  has  one  element  more  than  there  are  input  bits  to  the 
numerical  operation.  This  extra  element  is  called  the 
reference  bit;  it  is  always  transparent  and  its  phase  is 
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FIGURE  4.  (»>  EXAMPLE  PHASOR  OIAGRAM  FOR  HOLOGRAM  RECOROiNG  IN  NANO-GASED 
NUMERICAL  OPTICAL  PROCESSING,  (b)  PHASOR  DIAGRAM  SHOWING  POSSIBLE 
RESULTANT  AMPLITUDES  AT  A  PARTICULAR  OETECTOR.  NUMBERS  INDICATE 
DEGENERACY  OF  PHASOR. 


defined  to  be  zero  degrees.  Elements  in  the  mask 
corresponding  to  'don't  care"  positions  in  the  input  word 
•re  made  opaque.  Elements  that  represent  ones  are 
transparent  and  are  shifted  in  phase  by  180  degrees, 
elements  that  represent  zeros  are  transparent  and  have  a 
phase  of  zero  degrees.  The  hologram  of  each  of  the  bit 
positions  is  recorded  in  the  electrooptic  crystal  by 
interference  with  the  reference  beam.  All  holograms  are 
recorded  to  the  same  diffraction  efficiency  except  the 
reference  bit;  it  is  recorded  to  H  times  the  amplitude 
efficiency  of  the  others,  where  N  is  the  number  of  ones  in 
the  input  word.  This  can  be  done  by  passing  more  intense 
light  tnrough  the  reference  location,  or  by  recording  it  for 
a  longer  period  of  time.  When  holograms  for  all  the 
required  truth-table  patterns  have  been  recorded,  the 
processor  is  complete  and  data  processing  may  begin.  Light 
from  the  transparent  locations  of  the  data  mask  reconstructs 
a  subset  of  the  recorded  holograms.  The  light  reaching  any 
detector  element  will  have  one  of  a  number  of  possible 
amplitude  states,  as  shown  for  this  example  in  Figure  4b. 
But  complete  destructive  interference  will  occur  to  produce 
darkness  at  a  detector  element  only  if  the  input  data 
pattern  matches  the  corresponding  recorded  pattern. 
Conceptually,  the  recorded  holograms  may  be  thought  of  as 
’  ‘-'•mining  the  connection  of  input  bit  positions  to  an 
optical  Nar.d  operation.  Positions  it  are  zeros  in  the 
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recorded  pattacn  are  connected  in  complemented  form,  ones 
•re  connected  in  uncomplemented  form.  Determination  of  the 
state  of  the  input  compoaer  is  shown  for  an  example  in 
Figure  S.  The  logical  operation  performed  in  parallel  by 
the  Nand  processor  on  the  input  bits  is  given  in  Equation  2. 
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wheret 

O  is  the  output  hit 
X.  is  the  kth  bit  of  the  input  data 
Pr.  is  the  kth  bit  of  jth  recorded  hologram  pattern 
a^is  the  nuober  of  bits  in  the  input  data  words 
a  is  the  number  of  holograms  recorded  for  the  output 
bit 

♦  is  the  Identity  function 

*  la  the  optically  performed  Nand  operation 

a  In  the  And  operation  performed  by  the  detector. 


Parallel  processing 

The  explanations  given  for  the  operation  of  both 
processing  systems  dealt  with  only  a  single  output  bit. 
Practical  parallel  operations  require  simultaneous 
production  of  many  output  words,  each  composed  of  multiple 
bits.  The  principles  of  parallel  operation  are  essentially 
the  same  for  both  forms  of  the  processor.  Page  composer 
locations  for  bits  in  the  same  output  word  should  be 
arranged  along  the  plane  of  the  hologram  recording  beams. 
For  the  sake  of  discussion,  and  in  the  following  figures, 
this  is  assumed  to  be  the  horizontal  plane.  A  single  set  of 


FIGURE  5.  DIAGRAM  OF  THE  CONCEPTUAL  WAY  RECORDED  HOLOGRAMS 
DETERMINE  THE  CONNECTION  OF  INPUT  BITS  TO  THE  OPTICAL 
"NAND"  OPERATION.  (•)  LOGIC  RECORDED  HOLOGRAPHICALLY 
FOR  TRUTH-TABLE  ENTRY  OF  1010.  lb)  RESPONSE  OF  SYSTEM 
TO  A  NON-MATCHING  INPUT,  (c)  RESPONSE  OF  SYSTEM  TO  A 
MATCHING  INPUT. 
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holograms  nay  then  be  used  to  process  Multiple  sets  of  input 
date  stacked  in  the  perpendicular  (vertical)  direction. 
Slaultaneous  processing  of  Multiple  sets  of  data  is  possible 
because  holograns  display  little  angular  selectivity  for 
light  displaced  along  the  direction  perpendicular  to  the 
plane  of  the  recording  beans.  Diagrams  indicating  the 
recommended  arrangements  are  shown  for  the  Exclusive  Or 
processor  in  Figure  6,  and  for  the  Nand  processor  in 
Figure  7.  in  both  figures,  the  Fourier  transform  lenses 
have  been  omitted  to  simplify  the  diagrams.  A  theoretical 
analysis  of  the  reconstruction  of  data  page  holograms  with 
displaced  beams  (24]  has  provided  convincing  support  of  the 
feasibility  of  these  configurations. 

Even  this  degree  of  parallelism  makes  use  of  only  a 
fraction  of  the  capability  available.  For  the  Exclusive  Or 
processor,  it  is  conceptually  simple  to  record  reference 
patterns  for  different  operations  with  the  reference  beam  at 
different  angles.  Then  different  processing  operations  may 
be  achieved  by  altering  the  angle  of  the  reference  beam 
during  readout.  For  both  forms  of  processing,  only  a  single 
physical  location  in  the  crystal  has  been  considered  so  far. 
If  the  size  of  the  data  maBk  Fourier  transform  at  the 
crystal  will  allow,  several  independent  sets  of  holograms 
can  be  recorded  at  different  locations  in  the  crystal. 
Processing  at  all  these  locations  could  be  carried  out  in 
parallel. 
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PlieuMion  fi£  PrOCefifllDfl  Systems 

The  differences  apparent  between  the  two  forms  of 
processing  discussed  suit  them  to  different  types  of  data 
processing  applications.  The  principle  strength  of  the 
Exclusive  Or  processor  is  that  it  makes  available  at  the 
detector  array  the  bit-by-bit  comparison  of  the  input  data 
with  the  stored  reference  patterns.  This  makes  it  ideal  for 
a  word/signature  detection  operation.  Fabrication  time  is 
short  since  all  reference  patterns  for  a  given  operation  are 
recorded  as  a  single  hologram.  Also,  no  preprocessing  of 
reference  data  patterns  must  be  performed.  Another  useful 
feature  is  that  many  sets  of  reference  patterns  can  be 
angularly  multiplexed  into  the  same  thick  recording  medium. 
The  operation  performed  by  the  processor  then  can  be  changed 
simply  by  changing  the  angle  of  the  reference  beam. 

The  advantages  of  the  Hand  processor  are  that  for  the 
same  numerical  operation,  it  requires  a  smaller  input  page 
composer  and  a  smaller  detector  array  thrn  the  Exclusive  Or 
processor.  This  makes  it  possible  to  process  more  channels 
of  input  data  simultaneously.  This  advantage  makes  the  Nand 
processor  the  appropriate  choice  for  most  demanding  parallel 
numerical  computation  applications.  A  further  advantage  is 
that  only  one  beam  of  input  light  is  used  for  processing. 
Thus  concerns  about  the  relative  amplitude  and  phase  of 
multiple  beams  are  eliminated  when  the  processor  is  used. 
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Although  such  concerns  do  exist  during  fabrication  of  the 
processor ,  it  is  generally  much  easier  to  control  the 
fabrication  environaent  than  the  application  environment. 

Though  this  thesis  is  not  intended  to  specify  the 
details  of  construction  for  a  fully  operational  processor , 
certain  examples  of  possible  implementations  for  various 
components  have  been  used  to  guide  development  of  the 
concepts.  The  actual  data  processing  that  occurs  requires 
only  the  time  needed  for  light  to  pass  from  the  input  plane 
to  the  output  plane.  In  fact,  processor  operation  could  be 
pipelined  by  presenting  data  to  the  system  at  intervals 
determined  only  by  optical  path  length  differences  through 
the  system.  Thus  the  operating  speed  of  any  practical 
system  will  be  set  by  the  cycle  rate  of  the  input  and  output 
devices.  The  fastest  modulators  currently  available  are 
based  on  the  electrooptic  effect.  They  are  capable  of 
switching  in  10~6  to  10 "8  seconds.  Although  suitable  arrays 
of  such  modulators  do  not  currently  exist,  there  is  no 
conceptual  reason  that  precludes  their  production.  Optical 
detectors  that  operate  in  the  range  of  10~8  seconds  are 
currently  available,  so  they  should  not  be  the  limiting 
factor.  Prom  considerations  of  the  number  of  reference 
patterns  recorded  that  are  presented  in  Chapter  III,  the 
number  of  parallel  channels  of  data  that  might  be  processed 
ranges  from  100  for  Exclusive  Or  processing  to  1000  for  Nand 
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processing.  Therefore,  a  processing  rate  of  10  to  10 
operations  per  second  nay  be  projected.  The  operation 
implenented  nay  be  siaple  addition  or  multiplication,  or  it 
could  be  evaluation  of  a.:  entire  polynomial  including 
several  additions  and  multiplications.  Other  presentations 
of  Exclusive  Or  and  Nand  optical  processing  can  be  found  in 
References  25,  26,  and  27. 
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CHAPTER  IZI 

TROTH -TABLE  IMgQBMAHfiH  STORAGE 

Effect  on  Eiacefifilng  Systems 
Both  the  Exclusive  Or  and  the  Nand  processing  systems 
operate  on  the  principle  of  truth-table  ltfbk-up.  The 
digital  operation  to  be  implemented  is  represented  by  a 
truth-table  relating  output  variables  to  input  variables. 
Information  culled  from  this  truth-table  is  stored  in  the 
processing  system  in  holographic  form.  Generally,  the 
greater  the  number  of  input  variables  in  the  chosen 
operation,  the  larger  the  truth-table  will  be.  In 
particular,  output  variables  that  depend  on  ,  many  input 
variables  will  require  large  truth-tables.  Por  Exclusive  Or 
processing  the  number  of  reference  patterns  recorded  affects 
the  size  of  the  input  page  composer  and  the  output  detector 
array.  For  Nand  processing  the  number  of  reference  patterns 
affects  the  number  of  holograms  that  must  be  recorded. 
References  28  and  29  deal  with  the  relationship  between 
truth-table  information  storage  and  the  optical  processing 
systems. 

In  either  processing  system,  the  capacity  available 
to  store  truth-table  information  is  limited.  The 
information  storage  capacity  of  thick  holograms  recorded  in 
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•lectrooptic  crystals  has  been  extensively  investigated  (30, 
31],  It  is  important  to  determine  the  storage  capacity 
required  to  implement  useful  operations,  and  to  investigate 
methods  of  minimizing  the  truth-table  size  for  a  selected 
operation.  The  information  obtained  is  useful  not  only  for 
the  optical  processing  systems  currently  under 
investigation,  but  for  any  direct  logic  implementation  of 
digital  processing,  regardless  of  the  technology  used. 
Design  of  very- large-scale  integration  (VLSI)  circuitry  and 
the  use  of  Programmable-Logic  Arrays  (PLA)  has  generated 
renewed  interest  in  truth-table  minimization. 

Two  methods,  which  can  be  used  in  combination,  have 
been  used  to  produce  substantial  reductions  in  truth-table 
size:  logical  reduction  of  truth-table  representations,  and 
implementation  of  digital  operations  in  a  residue  number 
system.  Both  methods  will  be  presented  in  detail  in  the 
following  sections. 


Trufch-Tahle  Reduction 

Before  considering  truth-table  reduction  methods  and 
effects,  the  general  strategy  for  truth-table  look-up 
processing  will  be  reviewed.  In  a  conventional  truth-table 
with  N  input  variables  there  will  be  2N  entries.  Each  entry 
specifies  the  state  (one  or  zero)  of  the  output  bit  for  a 
different  pattern  of  the  input  bits.  The  present  analysis 
assumes  multiple-output  systems  are  constructed  by  grouping 
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independent  single  output  systems.  This  is  not  a  necessary 
assumpt.^n,  but  is  a  practical  cne  for  the  processors 
considered.  For  a  digital  operation  that  has  M  bits  in  the 
output?  H  independent  truth-tables  will  be  stored. 
Truth-table  look-up  processing  can  be  implemented  by  storing 
the  entire  truth-table  for  each  output  bit  of  a  digital 
operation.  A  set  of  N  input  bits  is  grouped  to  represent 
two  (N/2)-bit  input  numbers,  the  output  bits  represent  the 
result  of  the  digital  operation,  for  example  the  sum  or 
product  of  the  input  numbers.  When  data  is  presented  to  be 
processed,  the  truth-tables  are  searched  for  the  recorded 
input  data  pattern  that  matches  the  set  to  be  processed. 

(  The  corresponding  output  bits  from  each  of  the  tables  form 

the  result  of  the  operation.  An  example  truth-table  with 
three  input  bits  and  two  output  bits  is  shown  in  Figure  8a. 

Since  the  output  bits  can  assume  only  one  of  two 

states,  a  significant  reduction  in  the  amount  of  'ruth-table 
information  stored  is  possible  by  including  only  those  input 
combinations  that  result  in  an  output  of  one,  or  only  those 
that  result  in  zero.  These  are  referred  to  as  the 

unity-result  and  the  null-result  truth-tables  respectively. 
For  all  the  truth-tables  in  this  study,  significantly  less 
than  half  of  the  ouput  bits  were  ones,  so  only  input 

combinations  producing  an  output  of  one  were  included. 
Examples  of  unity-result  truth-tables  are  given  in 
Figure  8b.  Stored  tables  are  searched  for  the  current 
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pattern  of  input  bits.  If  it  is  found,  the  output  for  that 
table  is  a  one,  otherwise  the  output  is  zero. 

Further  reduction  of  truth-table  information  can  be 
achieved  by  storing  only  minimal  prime  implicant  coverings 
of  the  unity-result  (or  null-result,  truth-table.  The 
current  input  pattern  is  presented  to  ach  of  the  stored 
prime  implicants.  If  any  of  them  prrojces  a  match,  then  the 
output  for  that  truth-table  is  a  one.  The  only  difference 
between  finding  matching  patterns  in  a  unity-result 
truth-table  and  in  a  prime  implicant  table  is  that  certain 
input  bit  positions  may  be  ignored  when  matching  prime 
implicants.  These  bits  vary  in  number  and  position  from 
implicant  to  implicant.  Both  forms  of  the  optical 
processing  system  can  work  with  unity-result  truth-tables 
and  with  prime  implicant  tables.  A  simple  example  of 
truth-cable  reduction  is  shown  in  Figure  8c  through  8e. 

The  process  of  determining  a  minimal  prime  implicant 
covering  from  a  unity  result  truth-table  has  been  studied 
extensively  (32,33).  Two  common  algorithms  used  are  the 
Karnaugh  map  and  the  Quine-McCluskey  method.  The  former  is 
a  graphical  method  and  proves  impractical  for  functions  of 
more  than  five  or  six  input  variables.  The  latter  is  able 
to  handle  any  number  of  input  variables,  but  when  programmed 
on  a  computer  it  is  quite  inefficient  in  terms  of  execution 
time  and  required  memory.  To  determine  the  feasibility  of 
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implementing  useful  digital  operations  on  the  processing 
systems  the  amount  of  reduced  truth-table  information  for 
those  operations  must  be  known.  Obtaining  this  information 
necessitated  the  reduction  of  very  large  truth-tables,  up  to 
sixteen  input  variables  in  some  cases.  Several  algorithms 
of  increasing  sophistication  were  used  to  reduce 
truth-tables.  The  process  of  truth-table  reduction 
separates  into  four  tasks:  1)  create  the  complete 
truth-table  for  the  desired  operation,  2)  find  the  prime 
implicants  of  the  truth-table,  3)  build  a  "table  of  choice" 
indicating  which  prime  implicants  cover  which  entries  in  the 
initial  truth-table,  and  4)  construct  a  minimal  prime 
implicant  covering  from  the  table  of  choice.  The  first  and 
the  third  processes  are  straightforward  and  consume 
relatively  little  effort.  The  second  and  the  fourth 
processes  become  very  time-comsuming  even  for  a  small  number 
of  input  variables. 

The  process  of  finding  the  prime  implicants  for  a 
truth-table  was  initially  programmed  as  the  Quine-McCluskey 
algorithm.  The  inefficiencies  in  this  algorithm  were 
apparent,  and  the  more  efficient  Tison  algorithm  [33]  was 
substituted.  The  Tison  algorithm  served  well  until  results 
for  binary  multiplication  for  two  numbers  of  more  than  four 
bits  each  were  needed.  Beyond  this  point  the  computer 
execution  time  and  the  memory  required  became  prohibitive. 
A  search  was  conducted  for  a  "state-of-the-art"  algorithm 
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and  a  tree  structured  approach  by  Morreale  [34]  was  adopted. 
This  algorithm  eliminates  much  of  the  redundancy  inherent  in 
the  other  algorithms  and  substantially  reduces  execution 
time  and  memory  required.  In  the  course  of  programming  this 
algorithm,  it  was  determined  that  time  and  memory  could  be 
further  conserved  for  the  problems  at  hand  by  modifying  the 
algorithm.  The  algorithm,  as  presented  by  Morreale, 
operates  on  the  input  bit  patterns  of  the  unity-result 
truth-table.  Because  the  unity  result  patterns  are  a  subset 
of  all  possible  input  patterns,  each  comparison  step  in  the 
algorithm  required  a  search  of  the  stored  patterns  to 
determine  if  the  desired  pattern  was  present.  It  is  more 
direct  to  operate  instead  on  a  vector  containing  the  state 
of  the  output  bit  for  all  input  combinations  arranged  in 
numerical  order.  The  search  step  in  the  original  algorithm 
then  becomes  a  simple  indexing  into  the  vector. 
Furthermore,  while  the  original  algorithm  must  store  only  a 
subset  of  all  possible  entries,  each  entry  that  is  stored 
includes  N  bit  positions.  The  algorithm  as  modified  stores 
an  entry  tor  every  possible  input  combination,  but  each 
entry  is  only  one  bit  position.  Therefore,  if  the  ratio  of 
unity-result  entries  to  total  truth-table  entries  is  greater 
than  1/N,  which  is  most  often  the  case,  then  the  modified 
algorithm  also  requires  less  memory.  The  modified  algorithm 
was  programmed  and  proved  to  be  very  satisfactory  for  all 
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cases  considered. 

The  problem  o£  finding  a  minimal  prime  implicant 
covering  of  the  table  of  choice  proved  to  be  less  yielding 
to  efforts  to  streamline  its  implementation.  Originally,  an 
algorithm  that  produced  substantial  reduction  of  the  table 
of  choice,  but  did  not  guarantee  a  minimal  result,  was  used. 
However,  to  produce  authoritative  results,  such  a  guarantee 
was  required.  The  tabular  method  using  recursive  branch  and 
bound  for  cyclic  tables  presented  by  Muroga  [33]  was 
adopted.  A  large  effort  went  into  optimizing  this 
algorithm's  use  of  computer  time  and  memory  and  into 
searching  for  a  more  efficient  algorithm  to  replace  it. 
Eventually,  however,  the  algorithm  itself  proved  to  be  the 
limiting  factor  in  the  size  of  reduction  problems  that  could 
be  solved  exactly. 

All  truth-table  reduction  programming  mentioned  so 
far  was  done  in  APL  on  the  CDC  Cyber  system.  APL,  with  its 
powerful  intrinsic  array  handling  operations,  is  a  natural 
and  common  choice  for  problems  of  this  type.  Further,  APL 
can  pack  logical  variables  as  single  bits  in  the  physical 
computer  memory.  However,  system  support  for  APL  is 
limited;  very  few  individuals  are  available  to  offer  advice 
on  programming  problems,  and  external  routines  such  as 
virtual  memory  are  unavailable.  Thus,  when  APL  programs 
could  not  deal  with  the  size  of  the  reduction  problems 
attempted,  the  programs  were  rewritten  in  Fortran  IV. 
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Special  subroutines  were  written  to  allow  Fortran  to  pack 
logical  variables  one  per  memory  bit.  This  considerably 
relieved  memory  size  problems,  but  execution  time  continued 
to  be  quite  long.  For  example,  calculation  of  the  size  of 
the  truth-table  for  the  fifth  most  significant  bit  of  the 
multiplication  result  of  two  six  bit  binary  words  required 
33.7  thousand  seconds  of  execution  time  on  the  Cyber 
computer. 

Humber  Systems 

The  ability  of  truth-table  look-up  processing  to 
implement  any  digital  function  provides  flexibility  in 
choosing  how  particular  numerical  operations  are  to  be 
accomplished.  In  particular,  the  number  system 
representation  to  be  used  may  be  a  system  design  parameter. 
Within  the  framework  of  binary  logic,  there  are  many  ways  to 
represent  a  number.  The  most  common  way  for  processing 
applications  is  the  base  two  fixed  radix  system,  usually 
called  the  binary  number  system  (BNS).  As  a  number  system 
with  broad  support  and  known  computational  properties,  the 
BNS  is  an  obvious  choice.  However,  it  sets  a  number  of 
impediments  in  the  path  of  high  6peed  data  processing. 
Chief  among  these  is  a  property  it  shares  with  all  fixed 
radix  systems,  the  interdependence  of  digit  results  in 
numerical  operations,  e.g.  the  need  for  carry  propagation 
in  addition  and  multiplication.  In  electronic  digital  logic 
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implementations  this  requires  that  the  most  significant  bits 
of  a  result  cannot  be  known  until  calculation  of  all  less 
significant  bits  has  been  completed.  In  truth-table  look-up 
processor  operation  it  has  another  undesirable  effect. 
Recall  that  the  size  of  the  truth-table  stored  for  an  output 
bit  increases  as  the  number  of  inputs  affecting  that  output 
increases.  Because  output  bits  in  the  binary  number  system 
depend  on  all  less  significant  input  bits,  the  corresponding 
truth-tables  can  be  enormous.  It  would  be  desirable  then, 
if  a  number  system  representation  with  little  or  no 
interdigit  dependence  could  be  used. 

Residue  number  systems  have  just  that  property.  The 
residue  representation  of  a  number  consists  of  a  group  of 
digits  corresponding  to  the  remainders  left  when  the  number 
is  divided  in  turn  by  each  member  of  a  chosen  set  of  moduli. 
The  properties  of  residue  number  systems  have  been  studied 
for  many  years  {35,36,37],  but  many  aspects  are  still  not 
completely  understood.  Among  the  known  facts  are:  1)  If  the 
chosen  set  of  moduli  are  relatively  prime  (no  two  contain  a 
common  factor),  the  range  of  numbers  that  can  be  uniquely 
represented  is  equal  to  the  product  of  the  moduli.  2)  The 
operations  of  addition,  subtraction,  and  multiplication  on 
residue  numbers  proceed  independently,  digit  by  digit. 
3)  So  long  as  it  can  be  guaranteed  that  the  final  result  of 
a  multiple  step  calculation  is  within  the  valid 
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representation  range,  overflow  of  that  range  for 
intermediate  results  does  not  affect  the  final  result.  The 
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of 
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digits  provides 
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reduction 

in 

the 
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operation  of  division  in  a  residue  system  requires 
interdigit  dependence.  Also,  the  conversion  between  a 
residue  representation  and  more  common  representations  is 
not  easily  accomplished.  Nevertheless,  it  is  a  worthy 
candidate  for  comparison  with  the  binary  number  system.  In 
a  residue  representation  a  digit  determined  by  a  modulus  H 
may  take  on  any  value  from  0  to  M-l.  To  make  the  residue 
number  system  compatible  to  use  with  binary  logic,  a  Binary 
Coded  Residue  (BCR)  representation  was  introduced.  That  is, 
the  BNS  representation  of  individual  residue  digits  is  used. 
Therefore  dependence  does  exist  between  bit  positions  within 
each  individual  residue  digit.  But  a  typical  residue  digit 
might  require  only  four  bits  to  represent  it,  whereas  the 
entire  result  could  require  thirty-two  or  more  bits.  So 
interdigit  dependence  is  kept  quite  localized.  A  brief 
review  of  residue  arithmetic  is  given  in  Pigure  9. 

Results  Ql  Hath -Table  Reduction  Computet  study 
A  comparative  study  of  the  amount  of  required 
truth-table  information  was  made  for  eight  categories 
representing  all  possible  combinations  of  three  parameters: 
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representation,  BNS  or  BCR;  numeric  operation,  addition  or 
multiplication;  and  compaction,  unity-result  truth-table  or 
minimal  prime  implicant  covering.  The  results  are  presented 
graphically  in  Pigure  10,  detailed  tabulated  results  are 
available  in  Appendix  1.  In  Pigure  10  the  number  of 
truth-table  entries  is  plotted  against  the  number  of 
variables  in  each  input  word.  The  total  number  of  variables 
is  twice  that  number.  It  is  apparent  that,  for  all  but  the 
shortest  word  lengths,  use  of  the  residue  representation 
produces  a  substantial  reduction  in  the  number  of  required 
truth-table  entries.  Logical  reduction  of  truth-tables 
produces  a  significant  improvement  in  all  cases,  but  is  most 
effective  for  results  in  the  BNS.  Addition  requires  fewer 
truth-table  entries  than  multiplication  for  all  cases  except 
unreduced  binary.  All  curves  grow  at  nearly  an  exponential 
rate,  but  curves  for  the  BCR  have  a  much  smaller  slope  than 
curves  for  the  BNS.  Detailed  analysis  of  issues  related  to 
these  truth-table  reduction  results  can  be  found  in 
References  38  and  39. 

Comparison  al  Results  with  Practical  Limits 
As  noted  in  a  previous  section,  the  number  of 
truth-table  entries  to  be  recorded  places  different  demands 
on  the  Exclusive  Or  processor  and  the  Nand  processor.  Por 
the  Exclusive  Or  processor  the  number  of  truth-table  entries 
translates  directly  into  the  number  of  rows  required  in  tne 


>*•  * 


S-7Z 


page  composer.  Of  course ,  the  rows  need  not  be  in  a  single 
vertical  column  but  may  be  folded  into  a  more  convenient 
square  format.  Plans  currently  exist  to  produce  page 
composers  with  dimensions  of  512  by  512  elements  [40].  If, 
for  example,  a  digital  operation  with  16  input  variables  is 
to  be  implemented,  then  512x512/16  or  over  16,000 
truth-table  entries  could  be  accomodated.  This  would  be 
sufficient  for  most  of  the  cases  presented  in  Figure  10. 
The  effect  of  using  such  a  large  number  of  entries  on  the 
system's  ability  to  handle  parallel  data  streams  would  be 
another  design  consideration. 

For  Nand  processing,  truth-table  entries  translate 
into  numbers  of  holograms  recorded  at  one  spatial  location 
in  the  crystal.  Previous  work  has  demonstrated  recording 
525  holograms  at  one  location  [30],  If  the  practical  limit 
is  assumed  to  be  in  the  range  of  500  to  1000,  then 
operations  in  the  BCR  system  are  possible  for  up  to  eight 
bits  of  precision.  This  is  somewhat  less  than  for  the 
Exclusive  Or  processor,  but  there  is  much  less  of  a 
trade-off  between  the  number  of  recorded  truth-table  entries 
and  the  number  of  parallel  data  streams  for  the  Nand 
processor . 

One  final  consideration  is  that  these  comparisons 
apply  to  operations  that  can  be  accomplished  in  a  single 
pass  through  the  optical  system.  If  more  precision  is 
desired,  it  is  possible  to  complete  a  result  by  combining 
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intermediate  results  cl  less  precision.  For  instance, 
bit  addition  can  be  broken  into  two  eight  bit  additions, 
a  sixteen  bit  multiplication  into  four  eight 
multiplications  and  three  additions. 
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CHAPTER  IV 


The  equipment  used  to  conduct  the  experiments  may  be 
classified  into  three  categories:  the  optical  system,  the 
video  system,  and  the  computer  system.  Each  of  these  will 
be  described  in  turn. 


Zhfi  Optical  System 

The  Exclusive  Or  processing  experiments  and  the  Hand 
processing  experiments  had  a  large  part  of  the  arrangement 
of  optical  components  in  common.  Where  arrangements  for  the 
experiments  differed,  the  differences  will  be  specifically 
mentioned.  Also,  for  both  sets  of  experiments,  the 
arrangement  of  optical  components  evolved  so  a3  to  produce 
the  best  results  possible.  The  arrangements  described  below 
are  the  initial  basic  configurations.  Additions  and 
modifications  that  were  made,  and  the  reasons  fee  '..hem,  will 
be  explained  in  Chapter  V  in  the  description  of  the 
experiments.  Figure  11  shows  the  optical  configuration  for 
Exclusive  Or  processing  experiments,  and  Figure  12  shows  the 
configuration  for  Hand  processing  experiments. 

All  components  of  the  optical  system  were  mounted  on 
an  air-suspension  vibration-isolation  table.  A  box  to  cover 
the  table  was  constructed  of  foamcore  sheets.  This  box,  by 


Al: 

AffOftLMr 

SS: 

lunSFiiw 

CA..CAJ. 

CmwmiiM  lnnn>>ty  Anwimw 

C. 

Elpcvo-epnc  Cryttp! 

I  A: 

ENcnamoHy  Canuoliad  Amplitude  ModwIMor 

EF: 

CNcvomcMIy  CortreOpd  Mm  Modutow 

IS,.  ESj.ES,: 

Eltcttemullv  ChwAM  SMMI 

Idcmwi,  lam 

Mmhn  L«w» 

Fkimr  Tmlom  Lanw 

M: 

OtnMmk 

O: 

Oiwi  !■»  NR 

F: 

PmhoK  ApprtuM 

FR: 

FaNrutixn  Rsutw 

R: 

Rttepnca  Imp  FmR 

V; 

VdnCuwi 

FIGURE  11.  OPTICAL  EXPERIMENTAL  SYSTEM  FOR  'EXCLUSIVE  OR"  EXPERIMENTS. 
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enclosing  all  the  optical  elements,  except  the  laser  head, 
reduced  thu  effect  of  air  currents  and  thermal  gradients  on 
the  experiments.  It  did,  however,  contribute  to  the  need  to 
have  the  phase  and  amplitude  of  the  reference  beam  under 
electrical  control,  as  discussed  below.  The  principal  light 
source  used  for  the  experiments  was  an  argon  ion  laser  tuned 
to  operate  at  a  freespace  wavelength  of  514. 5  nanometers. 
An  intracavity  temperature  stabilized  etalon  was  used  to 
insure  spectral  purity  of  the  laser  output.  The  width  of 
the  beam  at  the  laser  output  was  about  2.0  mm.  A  crystal 
polarization  rotator  was  used  to  change  the  vertically 
polarized  laser  output  to  horizontal  polarization.  This 
polarization,  combined  with  the  horizontal  angular 
displacement  of  the  object  and  reference  beams,  and  the 
orientation  of  the  lithium  niobate  crystalline  axis,  is  the 
most  efficient  configuration  for  hologram  recording.  A  50% 
beam  splitter  was  used  to  separate  the  object  and  reference 
beams.  Electronically  controlled  shutters  were  located  at 
the  output  of  the  laser  and  in  the  object  and  reference 
beams.  Precise  timers  in  the  shutter  controllers  were  used 
to  regulate  the  exposure  time  for  recording  holograms. 

The  reference  beam  passed  through  a  compensated 
variable  attenuator  and  tnen  to  electrically  controllable 
amplitude  and  phase  modulators.  The  amplitude  modulator 
was  assembled  from  a  pair  of  Glan-Thompson  crystal 
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polarizers  on  either  side  of  an  electrooptic  KD*P 
polarization  rotator.  The  first  polarizer  was  used  to 
improve  the  lOOtl  polarization  ratio  of  the  laser  output 
light.  The  voltage  to  drive  the  polarization  rotator  was 
provided  by  a  high-voltage  power  supply  that  could  be 
controlled  manually  or  by  computer.  Computer  control  was 
especially  desirable,  since  the  program  could  automatically 
account  for  the  sinusoidal  dependence  of  modulation  on 
voltage.  Because  the  voltage  range  of  the  power  supply  was 
not  sufficient  to  produce  a  full  90  degree  rotation  of  the 
light  polarization,  the  second  polarizer  was  mounted  t>r>  that 
its  axis  could  be  rotated.  Thus  by  inserting  a  quarter-  or 
half-wave  plate  it  was  possible  to  control  electrically  the 
modulator  about  its  transmission  null,  peak,  or  half 
intensity  point.  The  single  element  phase  modulator  was  a 
KD*P  crystal.  Another  computer  controlled  power  supply 
provided  sufficient  voltage  to  drive  this  modulator  over  a 
full  180  degree  range.  From  the  phase  modulator  the 
reference  beam  proceeded  to  the  position  of  the  lithium 
niobate  crystal. 

The  diameter  of  the  object  beam  was  expanded  by  a 
focusing  lens  followed  by  a  collimating  lens.  Their  focal 
lengths  were  12.8  mm  and  380  mm  respectively,  giving  a  beam 
expansion  ratio  of  29.7,  and  an  output  beam  diameter  of 
59.4  mm.  At  the  focal  point  of  this  beam  expansion  system  a 
spatial  filter  with  a  15  micron  aperture  was  used  to 


eliminate  unwanted  transverse  modes.  The  expanded  object 
beam  passed  through  a  data  mask,  which  will  be  described  in 
detail  below.  A  Fourier  transform  lens  was  placed  at  its 
focal  length,  380  mm,  beyond  the  data  mask.  Another  focal 
length  beyond  the  lens  was  the  position  of  tne  lithium 
niobate  crystal. 

For  the  Exclusive  Or  processing  experiments  the  angle 
between  the  object  and  reference  beams  was  fixed  at  30 
degrees.  However,  for  the  Nand  processing  experiments  a 
series  of  angularly  displaced  reference  beam  positions  were 
used.  This  was  accomplished  by  interposing  two  mirrors  in 
the  reference  beam  path  between  the  phase  shifter  and  the 
lithium  niobate  crystal. 

The  lithium  niobate  crystal  used  for  the  experiments 
was  a  right  parallelepiped  measuring  10  mm  by  10  mm  by 
2.0  mm.  The  square  faces  were  polished  to  optical  flatness. 
The  crystal  was  Y-cut  and  thus  the  C  axis  is  parallel  to  the 
polished  faces.  The  crystal  was  mounted  with  the  C  axis 
horizontal  and  directed  to  the  right  when  viewed  along  the 
direction  of  propagation  of  the  beams.  The  C  axis  faces 
were  left  open-circuited.  The  crystal  was  held  in  a  stepper 
motor  driven  mount  that  could  be  translated  parallel  to  the 
C  axis  of  the  crystal  and  rotated  about  the  vertical  axis. 
Both  degrees  of  freedom  were  used  in  the  experiments  to 
multiplex  spatially  and  angularly  many  holographic 


recordings  into  one  crystal. 

To  maintain  aperture  size  and  position  uniformity, 
all  data  masks  used  for  the  Exclusive  Or  experiments  were 
based  on  a  chrome  master  mask.  This  chrome  master  was  a 
glass  plate  with  a  32  by  32  array  of  100  micron  diameter 
circular  chrome  dots  on  400  micron  centers.  The  masks  used 
for  the  experiments  were  glass  plates  with  a  high  resolution 
photographic  emulsion  on  one  side  that  underwent  a  two-step 
contact  exposure  process.  The  first  step  was  a  contact 
exposure  with  the  chrome  master.  If  the  plate  was  developed 
immediately  after  this  step  the  result  would  be  a  32  by  32 
array  of  transparent  apertures  on  an  opaque  background. 
Instead,  however,  a  second  contact  exposure  was  done  with  a 
film  mask,  photoreduced  from  a  hand-drafted  pattern.  The 
film  mask  shielded  selected  apertures  from  the  second 
exposure;  these  apertures  would  remain  transparent  upon 
development  of  the  plate.  Approximately  half  the  apertures 
in  the  masks  used  for  Exclusive  Or  processing  experiments 
were  transparent;  these  were  selected  on  a  random  basis 
using  a  computer  generated  pattern.  Photographs  of  the 
actual  mask  patterns  used  for  the  Exclusive  Or  processing 
experiments  are  shown  in  Figure  13.  Only  a  few  apertures 
near  the  center  of  the  array  were  left  transparent  on  the 
masks  for  the  Nand  processing  experiments.  These  were 
selected  on  the  basis  of  representative  reference  patterns. 
Because  lightwave  phase  was  important  in  both  forms  of 


FIGURE  13.  PHOTOGRAPHS  OF  DATA  MASKS  USED  FOR  "EXCLUSIVE  OR 
PROCESSING. 
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processing,  a  liquid  gate  was  used  to  compensate  for 
differences  between  the  masks  used.  This  would  not  be  a 
problem  in  an  actual  system  where  a  single,  electronically 
alterable,  mask  is  used. 

Beyond  the  position  of  the  lithium  niobate  crystal, 
the  experimental  configurations  for  the  two  forms  of 
processing  differed.  For  Exclusive  Or  processing  the  output 
data  occurs  along  the  path  of  the  object  beam;  for  Nand 
processing  it  occurs  along  the  path  of  the  reference  beam. 
For  the  Exclusive  Or  processing  experiments,  a  second 
Fourier  transform  lens,  identical  to  the  first,  was 
positioned  one  focal  length  beyond  the  lithium  niobate 
crystal,  along  the  object  beam  path.  A  vidicon  camera  tube 
was  placed  with  its  photocathode  face  one  focal  length 
beyond  the  lens.  This  allowed  direct  observation  of  the 
processor  output  with  the  video  system. 

The  arrangement  for  the  Nand  processing  experiments 
was  more  complicated.  There  were  three  design  objectives 
for  that  part  of  the  optical  system  following  the  crystal 
position  in  the  Nand  experiments.  One  was  to  be  able  to 
view  the  reference  beam  on  the  video  system  without 
realigning  components  when  the  beam  changed  angular 
positions.  This  was  important  since  output  data  bits  occur 
at  the  reference  beam  positions.  Zt  was  desirable  to  see 
all  bits  simultaneously.  A  corollary  to  this  was  that  the 


Image  of  the  reference  beam  should  sized  so  that  adjacent 
positions  did  not  overlap.  A  second  objective  was  that  it 
should  be  possible  to  view  the  object  beam  mask  with  the 
video  camera  so  that  different  masks  could  be  placed  in  the 
object  beam  and  aligned  with  previous  masks  by  using  the 
video  system.  The  final  objective  was  that  the  object  and 
reference  wavefronts  be  visible  simultaneously  on  the  video 
system  for  the  purpose  of  using  the  resulting  interference 
fringes  to  monitor  the  phase  of  the  beams  during  hologram 
recording.  This  meant  the  beams  had  to  be  brought  together 
at  a  very  small  angle  so  that  the  fringes  would  be  larger 
than  the  minimum  resolution  limit  of  the  camera. 

All  three  objectives  were  met  by  the  arrangement 
shown  in  Figure  12.  The  object  beam  path  differed  little 
from  that  used  for  the  Exclusive  Or  experiments.  A  second 
Pourier  transform  lens  was  used,  followed  by  a  lens  to  relay 
the  image  to  the  camera.  The  simultaneous  requirements  of 
controlling  reference  beam  spot  size  and  spot  separation 
made  design  of  the  reference  beam  path  quite  a  challenging 
problem.  The  fact  that  all  angularly  separated  reference 
beam  paths  passed  through  the  same  point  in  the  crystal,  yet 
the  reference  beam  itself  was  collimated,  meant  that 
regardless  of  the  lens  used,  the  reference  beam  would  focus 
before  the  paths  reconvergud.  The  solution  was  to  use  one 
lens  to  bring  the  reference  beam  to  a  focus  at  different 
points  in  a  plane  for  different  paths,  then  use  a  second 


lens  to  produce  a  demagnified  Image  of  that  plane.  This 
image  was  slightly  defocussed  to  give  the  reference  beam 
spots  an  appropriate  diameter  at  the  camera  plane.  The 
object  and  reference  beams  were  combined  with  a  beam 
splitter.  By  aligning  so  that  the  reference  spot  coincided 
with  one  of  the  object  beam  spots  at  the  beam  splitter  and 
at  the  camefaf  the  fringes  produced  were  sufficiently  large 
to  be  conveniently  monitored.  The  reference  beam  wavefront 
was  spherical  and  the  object  beam  wavefront  was  plane#  so 
the  fringes  produced  were  circular;  but  this  did  not  produce 
any  difficulty. 

Figure  14  is  a  photograph  portraying  the  optical 
equipment  used.  The  arrangement  pictured  is  essentially  the 
one  used  for  the  Nand  processing  experiments.  Items  may  be 
located  by  referring  to  the  schematic  in  Figure  12.  Shown 
are  the  argon  laser  and  its  power  supply#  the  liquid  gate# 
and  the  crystal  mount.  In  the  center  foreground  is  a 
scanning  spectrum  analyser#  not  used  for  the  principal 
experiments#  but  for  precise  measurement  of  the  frequency 
stability  of  the  laser. 

The  Video  System 

Tne  video  system  was  the  primary  tool  for  taking  data 
in  all  the  processing  experiments.  A  block  diagram  of  the 
video  system  is  given  in  Figure  15.  Synchronization  signals 
for  the  entire  video  system  were  generated  by  a  crystal 


controlled  oscillator.  The  line  rate  used  was  875 
horizontal  scans  per  video  frame.  Video  frames  consisted  of 
two  Interlaced  video  fields;  each  field  being  completed  in 
one  sixtieth  of  a  second.  The  length  of  the  vertical 
blanking  period  between  fields  varied  with  time,  but  the 
number  of  visible  scans  stayed  between  810  and  820.  The 
Sierra  Scientific  video  camera  had  rack-mounted  electronics 
and  a  remote  head  containing  the  vidicon  tube.  An  antimony 
trisulfate  (Sb2S3)  photocathode  was  used.  The  camera 
electronics  provided  for  control  of  all  the  voltages  applied 
to  the  vidicon  tube  electrodes;  the  performance  of  the 
camera  could  be  tailored  to  a  variety  of  different  operating 
conditions. 

The  video  output  of  the  camera  was  connected  to  a 
Colorado  Video  model  321  video  analyser  unit.  This  device 
produces  a  DC  voltage  output  proportional  to  the  brightness 
of  any  selected  location  on  the  video  image.  The  video 
analyser  also  superimposes  horizontal  and  vertical  cursor 
lines  on  the  displayed  image  to  indicate  the  location  of  the 
sampled  point.  The  unit  was  designed  to  allow  the  sampled 
point  to  be  selected  by  manual  front  panel  adjustments,  but 
modifications  were  cade  to  put  this  selection  under  computer 
control.  Control  was  effected  by  substituting  voltages 
produced  by  digital  to  analcg  converters  (D\C)  in  the 
computer  system  for  front  panel  potentiometers.  Another 
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output  of  the  video  analyser  is  also  of  Interest.  During 
every  vertical  video  scan  each  horizontal  raster  line  is 
sampled  at  the  position  of  the  vertical  cursor  line.  The 
video  analyser  provides  an  output  voltage  signal 
proportional  to  the  brightness  at  the  sampled  points;  this 
signal  changes  at  the  horizontal  scan  rate.  Thus,  the 
signs1  representc  a  vertical  cross-section  of  the  image 
brightness.  The  data  rate  of  this  signal  was  well  matched 
to  the  sampling  rite  of  the  analog  to  digital  conversion 
(ADC)  system  of  the  computer.  Therefore,  this  signal  was 
used  as  the  source  for  all  video  digitizing  and  storage 
operations. 

(  The  video  image  was  displayed  on  a  Conrac  monitor. 

Also,  a  high-resolution  Tektronix  electrostatic  monitor  was 
available.  A  photographic  camera  could  be  attached  to  it  to 
produce  a  permanent  record  of  images. 

A  video  control  unit,  designed  and  built  in  the 
Optics  Lab,  was  employed  to  direct  and  distribute  video 
signals  to  the  selected  locations  in  the  video  system.  A 
further  function  of  this  controller  was  to  monitor  the 
amplitude  of  the  video  signal.  If  any  point  within  the 
image  exceeded  a  preset  intensity  level,  the  controller 
would  cause  an  electrically-controlled  shutter  in  front  of 
the  video  camera  to  close.  This  was  to  guard  against 
accidental  burns  on  the  camera  tube. 

The  Colorado  Video  model  275  digical  video  image 
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memory  was  also  used.  A  custom  board  interfacing  it  to  the 
computer  was  designed  and  built.  Then  data  digitized  and 
stored  by  the  computer  could  be  redisplayed  at  a  later  time 
for  viewing  and  analysis.  One  other  piece  of  electronic 
equipment  used  was  the  Princeton  Applied  Research  model  186A 
synchro-het  lock-in  amplifier.  Its  use  for  phase 
stabilization  will  be  explained  in  Chapter  V. 

Figure  16  is  a  photograph  of  the  electronic  equipment 
used  for  the  experiments.  The  equipment  rack  on  the  right 
contains  the  video  equipment,  in  the  center  rack  are  high 
voltage  power  supplies  and  the  lock-in  amplifier,  in  the 
left  hand  rack  is  the  MicroNova  computer  system.  Visible 
further  to  the  left  are  the  terminal  for  the  MicroNova 
system,  a  printer,  and  a  terminal  for  communication  with  the 
Eclipse  S250  AOS  system.  Not  pictured  are  the  electronic 
shutter  controllers  and  the  stepper  motor  drive  for 
positioning  the  crystal  mount. 

xh£  laaputer  System 

The  computer  system  used  for  experiment  control  and 
data  acquisition  will  be  described  in  two  aspects,  the 
hardware  and  the  software. 

The  hardware  comprising  the  Optics  Lab  MicroNova 
computer  system  is  diagrammed  in  Figure  17.  Use  of  the 
computer  for  experiment  control  and  data  acquisition 
required  coordinated  operation  of  many  input/output 
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facilities.  A  digital  to  analog  converter  board  pro\  .ded 
voltages  to  control  the  position  of  the  video  analyser 
cursors.  An  analog  to  digital  converter  board  was  used  to 
receive  data  signals  frou  Lhe  video  analyser.  A  custom 
designed  board  combining  digital  to  analog  conversion  with 
digital  input  and  output  ports  was  used  to  control  high 
voltage  power  supplies  used  to  nodulate  the  amplitude  and 
phase  of  the  reference  beam.  Another  custom  board  was  used 
to  interface  the  computer  to  the  digital  video  image  memory 
unit.  Video  data  taken  from  experiments  was  stored 
temporarily  on  the  hard  disk  unit.  A  permanent  record  of 
all  data  taken  was  made  on  magnetic  tape.  Data  was 
transferred  to  the  Eclipse  S2S0  AOS  system  for  analysis  with 
a  9600  baud  serial  communications  link. 

To  manage  software  development  in  the  Optics  Lab,  a 
programming  environment  was  established  within  the  Disk 
Operating  System  (DOS)  framework  of  the  MicroNova  computer. 
The  environment  consisted  of  a  number  of  macro  command  files 
designed  to  accomplish  frequently  needed  tasks.  A  number  of 
student  assistants  worked  on  software  for  interface  with  the 
experiment,  and  the  programming  environment  was  useful  for 
introducing  them  to  the  computer  system,  and  for  encouraging 
proper  documentation  procedures.  Software  developed  in  the 
Optics  Lab  falls  into  the  following  categories: 

1)  VIDEO,  DATAXE:  programs  to  control  the  video 
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analyser  unit  for  the  purpose  of  digitizing  and  storing 
experimental  results.  These  programs  allowed  the  user  to 
select  and  display  the  subsections  of  the  video  picture  to 
be  digitized.  The  number  of  scans  over  which  the  video  data 
was  to  be  integrated  and  the  spacing  between  sample  points 
could  be  specified.  The  programs  automatically  included  in 
each  data  file  documenting  information  such  as  time  and  date 
of  acquisition,  and  position  and  extent  of  the  data  on  the 
screen.  Also,  quantitative  calibration  of  the  video 
analyser  signal  was  facilitated  by  this  program. 

2)  VIDMEM :  a  program  to  control  the  digital  video 
image  memory  for  the  purpose  of  redisplaying  previously 
digitized  video  data.  The  user  interface  for  this  program 
was  designed  to  resemble  closely  that  for  the  video  analyser 
program. 

3)  OPAMPPS,  PPSAH:  programs  to  control  high  voltage 
power  supplies  for  the  purpose  of  operating  electrooptic 
phase  and  amplitude  modulators.  The  programs  were  designed 
to  take  into  account  modulator  half-wave  voltages,  laser 
light  wavelength,  experimental  configuration  being  used,  and 
the  sinusoidal  dependence  of  light  amplitude  on  applied 
voltage. 

4)  SEND,  RECEIVE:  programs  to  transmit  and  receive 
data  and  other  files  over  the  serial  communications  link  to 
the  Eclipse  S250. 
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CHAPTER  V 

FEASIBILITY  EXPERIMENTS 

Experiments  were  conducted  to  investigate  the 
practical  operation  of  Exclusive  Or  and  Nand  processing 
systems.  The  experiments  demonstrated  the  principles  of 
both  forms  of  processing,  and  identified  factors  that  affect 
the  probability  of  error  for  processing  operations.  On  the 
basis  of  statistical  data  taken  in  the  Exclusive  Or 
processing  experiments,  a  value  for  the  probability  of  error 
for  the  processing  system  has  been  calculated.  The 

(  procedures  used  for  the  experiments,  and  the  results 

obtained  are  r resented  in  this  chapter. 

Experimental  investigation  o£  Exclusive  Or  Processing 
Demonstration  al  Exclusive  Or  optical  Processing 

To  demonstrate  Exclusive  Or  optical  processing,  the 
following  procedu  c  was  used.  A  data  mask  was 

holographically  recorded  in  the  crystal.  Then  the  data  mask 
was  changed  to  represent  a  different  set  of  inputs.  The 

direct  image  of  this  data  mask  was  superimposed  with  the 
image  of  the  original  data  mask  reconstructed  by  the 
reference  beam.  The  phase  and  amplitude  of  the  reference 
beam  were  adjusted  to  give  the  best  visual  result. 

Data  page  holograms  were  typvcally  recorded  with 
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20  im  of  power  In  the  object  bean,  one  mW  of  power  in  the 
reference  beam,  and  with  an  exposure  duration  of  ten 
seconds.  This  procedure  yielded  holograms  with  a  power 
diffraction  efficiency  of  5xio  3%.  Visually,  the  data  page 
holograms  exhibited  high  contrast,  low  noise,  and  good 
resolution.  Profiles  taken  through  the  reconstructed  image 
of  the  apertures  revealed,  however,  approximately  Gaussian 
shapes  rather  than  the  expected  flat  top  and  square  sides. 

Initially,  a  direct  image  of  the  data  mask  onto  the 
vidicon  tube  produced  uneven  brightness  in  the  apertures. 
Interference  fringes  resulting  from  the  nearly  parallel 
front  and  back  surfaces  of  the  optical  window  on  the  vidicon 
(  tube  were  responsible.  These  fringes  were  eliminated  by 

introducing  an  imaging  lens  between  the  second  Fourier 
transform  lens  and  the  camera,  as  shown  in  Figure  18.  In 
this  way,  wavefronts  reaching  the  camera  were  spherical 
rather  than  plane,  and  produced  fringes  with  a  period 
smaller  than  the  resolution  limit  of  the  video  camera. 

After  recording  the  first  few  holograms,  subsequent 
recordings  became  less  efficient  and  of  poorer  visual 
quality,  though  recording  parameters  were  the  same.  The 
degradation  of  recording  quality  can  be  attributed  to  the 
build-up  of  large  scale  electric  fields  in  the  crystal. 
These  fields  result  from  the  accumulation  of  electric  charge 
at  the  boundary  between  the  illuminated  and  the 
unilluminated  portions  of  the  crystal.  The  electric  fields 
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FIGURE  U.  IMAGING  LENS  ADDED  TO  "EXCLUSIVE  OR"  PROCESSING  CONFIGURATION 
TO  ELIMINATE  INTERFERENCE  FRINGES  ON  VIDEO  CAMERA  FACEPLATE. 
COMPARE  TO  FIGURE  tt. 


5-/0*/ 


64 


were  relaxed,  and  all  holograms  in  the  crystal  erased,  by 
exposing  the  crystal  to  a  source  of  intense  incoherent 
light.  A  mercury  arc  lamp  was  used  to  expose  the  crystal 
for  a  period  of  one  hour.  The  previous  holograms  were 
completely  erased,  and  new  holograms  that  produced  accurate 
reconstructions  could  be  recorded  again  in  the  crystal. 
However,  in  the  course  of  subsequent  experiments,  charge 
build-up  in  the  crystal  recurred.  When  a  second  erasure  of 
the  crystal  became  necessary,  the  use  of  intense 
illumination  did  not  renew  the  crystal  as  completely  as  the 
first  erasure.  Previously  recorded  holograms  were 
eliminated,  but  the  entire  crystal  was  left  with  a  fine 
pattern  of  refractive  index  inhomogeneities,  commonly  known 
as  a  "shower-glass11  pattern.  These  index  of  refraction 
inhomogeneities  scattered  light,  making  it  impossible  to 
record  acceptable  holograms.  A  second  method  known  to  be 
useful  for  erasing  refractive  index  modulations  in  lithium 
niobate  crystals  was  then  employed.  The  crystal  was  heated 
to  190  degrees  Celsius,  in  an  oxygen  atmosphere,  for  a 
period  of  one  hour.  Then,  the  crystal  was  allowed  to  cool 
overnight.  This  treatment,  however,  only  exaggerated  the 
shower-glass  effect.  Another  crystal,  GT-13,  with  the  same 
dimensions  and  doping  as  the  first  crystal,  had  to  be  used. 
Eventually  crystal  GT-13  was  in  need  of  erasure  too.  The 
same  shower-glass  pattern  was  induced  in  it  by  the  erasure 
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procedure.  A  third  crystal,  GT-16,  also  sharing  the  same 
dimensions  and  doping,  was  used  for  the  remaining  Exclusive 
Or  experiments.  These  included  all  experiments  in  which 
probability  of  error  data  were  taken. 

The  first  attempt  to  superimpose  wavefronts  from  the 
object  and  diffracted  reference  beam  revealed  that  the 
reconstructed  image  of  the  mask  was  displaced  horizontally, 
about  one  quarter  of  an  aperture  diameter,  from  the  position 
of  the  direct  image.  The  displacement  occurred  though 

nothing  had  been  moved  on  the  optical  bench  from  the  time 
the  hologram  was  recorded.  Elimination  of  this  displacement 
was  actively  sought  through  realignment  of  the  optical 
(  components,  but  the  cause  was  never  found.  The  magnitude 

and  direction  of  the  displacement  was  reproducible  over  a 
period  of  days,  but  did  vary  slightly  over  the  course  of 
several  weeks.  To  correct  for  the  displacement,  the  data 
mask  was  horizontally  translated  until  its  image  was  brought 
into  alignment  with  the  reconstructed  image.  The  correction 
worked  well  in  a  practical  sense  for  all  of  the  experiments, 
but  was  inconvenient  and  lacked  a  sense  of  precision. 

With  the  direct  and  reconstructed  data  mask  images 
aligned,  and  proper  adjustment  of  the  voltage  applied  to  the 
electrooptic  phase  modulator,  destructive  interference  of 
the  object  and  reference  beam  wavefronts  produced  the 
Exclusive  Or  result  at  all  visible  aperture.*  locations. 
Since  the  pattern  of  input  data  was  the  same  as  :he  pattern 
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recorded  In  the  crystal,  all  bit  locations  were  dark. 
Despite  the  air  suspension  of  the  optical  bench,  however, 
the  result  exhibited  some  small  sensitivity  to  mechanical 
vibrations  produced  in  the  building.  Also,  it  was  necessary 
to  cover  the  experiment  to  shield  it  from  air  currents. 

The  Exclusive  Or  operation  was  produced  using  as 
input  data  the  mask  used  for  recording  the  hologram  shifted 
horizontally  by  one  column.  With  proper  adjustment  of  the 
optical  system,  the  result  exhibited  the  expected  Exclusive 
Or  of  the  dissimilar  data  patterns.  Next,  a  mask  other  than 
the  one  used  for  recording  the  hologram  was  used.  To  verify 
that  a  liquid  gate  arrangement  would  be  required  for  the 
masks,  the  alternate  mask  was  positioned  in  the  object  beam 
without  the  liquid  gate.  Adjustment  of  the  reference  beam 
phase  was  able  to  produce  Exclusive  Or  nulls  within 
localized  regions  of  the  resulting  image,  but  not 
simultaneously  at  all  positions  where  nulls  were  expected  to 
occur.  Phase  variations  resulting  from  small  differences  in 
thicknesses  of  the  glass  plates  supporting  the  masks' 
photographic  emulsions  prevented  uniform  phase  interference 
over  all  apertures.  The  effect  was  not  noticable  when  using 
the  recorded  mask  displaced  by  one  column  as  the  input  data 
mask;  evidently  thickness  variations  over  any  one  mask  are 
vary  gradual.  The  liquid  gate  provides  a  chamber  that 
contains  a  fluid  with  an  index  of  refraction  matching  that 
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of  the  glass  substrate  of  the  data  mask  plates.  when  data 
masks  are  immersed  in  the  fluid,  phase  distortions  of  the 
object  beam  wavefronts  due  to  thickness  variations  in  the 
mask  plates  are  eliminated.  listortions  occur ing  at  the 
interfaces  between  the  liquid  ^hte's  outer  windows  and  the 
surrounding  air  are  fixed  in  amplitude  and  position, 
indepedent  of  the  mask  contained  within  the  liquid  gate. 
Such  fixed  distortions  are  not  detrimental  to  Exclusive  Or 
processor  operation.  The  entire  issue  of  data  mask  phase 
inhomogeneities  is  artificial  to  any  practical  working 
implementation  of  a  processor;  practical  processors  would 
have  a  single  data  mask  device  capable  of  altering  the 
transmissivity  of  its  individual  cells.  As  long  as  each 
cell  exhibited  a  consistent  phase  delay  whenever  the  cell 
was  transparent,  phase  distortion  of  the  beam  would  not 
influence  processor  operation.  With  the  use  of  the  liquid 
gate.  Exclusive  Or  nulls  were  obtained  over  all  visible 
apertures  regardless  of  the  relation  of  the  mask  used  as 
data  input  to  the  mask  used  to  record  the  hologram. 

Detailed  examination  of  the  Exclusive  Or  nulls 
obtained  from  destructive  interference  of  wavefronts  from 
the  imaged  and  reconctructed  beams  revealed  that  they  had  a 
nonuniform  intensity  profile.  Each  had  a  small  bright  point 
at  the  center  of  the  bit  position,  surrounded  by  a  dark 
ring,  surrounded  by  a  dimly  illuminated  ring,  surrounded  by 
the  uniform  darkness  of  the  background.  A  photograph  of 
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data  displaying  this  pattern  is  shown  in  Figure  19.  The 
pattern  is  not  associated  with  the  bright  spots  in  the 
figure,  but  with  the  din  optical  bit  locations.  The 
derivation  presented  in  Figure  20  gives  an  understanding  of 
the  cause  of  this  pattern.  The  envelope  of  the  Fourier 
Transfora  of  the  mask  is  found  to  be  an  Airy  function,  the 
first  zero-crossing  occurs  at  a  radius  of  2.4  mm.  This  is 
considerably  larger  than  the  half-width  half-power  radius  of 
the  reference  beam,  measured  as  1.0  ma.  Therefore,  the' high 
spatial  frequency  components  of  the  Fourier  transform  were 
not  being  recorded  in  the  hologram.  The  attenuation  of  high 
spatial  frequency  components  results  in  a  reconstructed 
image  of  apertures  that  are  less  sharply  peaked  and  broader 
than  apertures  in  the  direct  image  of  data  mask.  Figure  21 
shows  how  destructive  interference  of  the  dissimilar  mask 
images  leads  directly  to  the  observed  characteristic 
intensity  profile. 

The  dissimilarity  of  the  spatial  frequency  content  of 
the  images  is  corrected  by  placing  an  aperture  directly 
behind  the  crystal.  High  spatial  frequencies  are  removed 
from  the  direct  image  of  the  mask  to  the  same  extent  those 
frequencies  are  missing  from  the  reconstructed  image.  The 
correction  is  not  exact  since  the  amplitude  of  the  Fourier 
transform  of  the  recorded  data  pattern  is  still  distorted, 
within  the  limiting  aperture,  by  the  Gaussian  profile  of  the 
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FIGURE  19.  PHOTOGRAPH  SHOWING  DOT  AND  RING  PATTERN 
CHARACTERISTIC  OF  "EXCLUSIVE  OR"  RESULT 
PRODUCED  USING  AN  UNEXPANDED  REFERENCE 
BEAM.  VERTICAL  WHITE  LINES  ARE  CURSORS 
SUPERIMPOSED  BY  THE  VIDEO  ANALYSER. 
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9 MUM  20.  OAAPH'CAL  AMO  ANALYTICAL  PREPAESENTATION  Of  THt  f OAM  Of  THI  fOURIER 
TRANS’ 0AM  Of  THt  OATA  MASK  USED  fOA  TMI  "EXCLUSIVE  OA"  f  A0CIUIN0 
EXPERIMENTS.  <d  ALAN  VIE*  Of  A  AOATION  Of  THt  MASK.  (0ICA0SS  SECTION 
TAK1N  THROUGH  THt  CINTI A  Of  A  AON  0A  COLUMN  Of  API  ATUAIS  Of  THt 
OPTICAL  TAANSMITTAMCE  Of  THI  MASK.  Id  ANALYTIC  Aff  AISINTATfON  Of 
•  TWO-DIMENSIONAL  TAANSMlTTANCE  f  UNCTION  Of  THE  MASK.  If)  EXPAESSION 
fOA  THt  OOTICAL  fOUAlEA  TAANSf  0AM  Of  MASK  TAANSMlTTANCE.  Ill  NUMEAICAL 
VALUES  f  OA  QUANTITIES  Aff  t  AAINC  IN  THE  f  OAMULAE  ASO/E.  HI  CAOSS 
SECTION  THROUGH  fOUAlEA  TAANSf  OAM.  THI  GENERAL  fOA  MO*  THE  TRANSfOMM 
«  UNCHANGED  If  SOME  Of  THt  MASK  APERTURE  LOCATIONS  ARE  OPAQUE. 
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AMPLITUDE  PROFILE 
OF  OIRECT  IMAGE  OF 
APERTURE. 


AMPLITUDE  PROFILE 
OF  BROADENED 
RECONSTRUCTION 
OF  APERTURE  DUE 
TO  LOSS  OF  HIGH 
SPATIAL  FREQUENCIES. 


CHARACTERISTIC  PEAK  AND 
RING  INTENSITY  PROFILE 
RESULTING  FROM  SQUARE 
OF  DIFFERENCE  BETWEEN 
NARROW  AND  BROADENED 
APERTURES. 


FIGURE  21.  EXPLANATION  OF  PATTERN  RESULTING  FROM  "EXCLUSIVE  OR" 
WHEN  SPATIAL  FREQUENCY  CONTENT  OF  RECONSTRUCTED 
APERTURE  IS  LIMITED  BY  SMALL  REFERENCE  BEAM  DIAMETER 
AT  THE  CRYSTAL. 


reference  beam.  However ,  noticable  improvement  in  the 
uniformity  of  the  profile  of  the  nulls  was  produced  by  using 
an  aperture  with  a  diameter  of  2.0  mm. 

Use  of  the  limiting  aperture  was  found  to  have  some 
undesirable  effects  as  well.  First ,  the  match  between 
profiles  of  imaged  and  reconstructed  apertures  improves 
monotonically  with  decreasing  aperture  size.  This  is 
because  smaller  and  smaller  (and  therefore  flatter  and 
flatter)  portions  of  the  center  of  the  reference  beam  are 
allowed  to  pass.  Thusr  a  trade-off  is  introduced  between 
uniformity  of  the  nulls  and  total  optical  power  available  at 
the  detector  plane.  Second,  it  was  physically  impractical 
to  place  the  aperture  flush  against  the  back  surface  of  the 
crystal.  The  closest  practical  placement  was  about  10  mm 
behind  the  crystal.  This  meant  the  aperture  was  not  purely 
a  spatial  frequency  filter.  The  observed  result  was  that 
apertures  near  the  edge  of  the  field  of  view  were  dimmer 
than  those  in  the  center. 

In  view  of  these  difficulties,  a  second  method  of 
matching  the  profiles  of  the  imaged  and  reconstructed 
apertures  was  employed  instead.  By  expanding  the  reference 
beam,  high  spatial  frequencies  were  included  in  the  recorded 
Fourier  transform.  Resolution  of  the  reconstructed 
apertures  was  improved  to  nearly  equal  that  of  the  imaged 
apertures.  The  reference  beam  was  expanded  by  using  two 
lenses,  as  shown  in  Figure  22.  The  focal  lengths  of  the 
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FIGURE  22.  AOOITIONAL  LENS  SYSTEM  FOR  EXPANSION  OF  REFERENCE 
BEAM  DIAMETER.  COMPARE  WITH  FIGURE  11. 
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lenses  used  were  23  bob  and  78  bub,  giving  a  bean  expansion 
ratio  of  about  three.  Not  only  did  the  expanded  bean 
overlap  essentially  all  of  the  Fourier  transforn  of  the  data 
mask  at  the  crystal,  but  the  bean  extended  beyond  the  edges 
of  the  crystal.  The  problem  of  charge  build-up  in  the 
crystal  was  therefore  greatly  reduced.  The  aperture  at  the 
crystal  plane  was  no  longer  used. 

Figures  23,  24,  and  25  present  experimental  Exclusive 
Or  processing  results  that  were  obtained.  These  results 
were  obtained  by  photographing  video  images  on  the 
Tektronics  video  monitor}  the  video  system  is  depicted  in 
Figure  15.  Reproduction  techniques  unavoidably  have 
increased  the  contrast  of  the  photographs}  the  apparent 
variation  in  aperture  diameter  is  in  part  due  to  variations 
in  recorded  intensity  instead.  The  results  presented 
include  use  of  the  liquid  gate  and  the  expanded  reference 
beam.  The  data  Biasks  referred  to  as  A  and  B  in  the  figures 
are  subsections  taken  from  near  the  center  of  a  larger  32  by 
32  aperture  mask.  Magnification  for  the  image  presented  to 
the  video  camera  was  chosen  as  a  compromise  between  viewing 
a  large  number  of  apertures  and  being  able  to  see  the 
details  within  individual  apertures.  For  the  case  presented 
in  Figure  23,  mask  A,  representing  the  input  data,  is  just  a 
horizontally  displaced  version  of  mask  B,  the  mask  used  to 
record  the  reference  pattern  hologram.  However,  comparable 
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FIGURE  24.  DATA  PROCESSING  EXPERIMENTAL  RESULTS  til  "INCLUSIVE  OR" 
OF  DATA  PAGES  SHOWN  IN  FIGURE  23.  lb)  "EXCLUSIVE  OR"  WITH 
REFERENCE  BEAM  POWER  DECREASED  10%  (cl  "EXCLUSIVE  OR" 
RESULTS  WITH  REFERENCE  BEAM  POWER  INCREASED  10%. 
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FIGURE  25.  "EXCLUSIVE  OB”  BESUIT  OF  A  DATA  PAGE  WITH  ITSELF. 

(•I  IMAGE  OF  DATA,  (bl  RECONSTRUCTION  OF  RECORDED 
DATA,  (el  "EXCLUSIVE  OR"  RESULT. 


results  were  obtained  without  this  coincidental 
correspondence  between  masks. 

Figure  23a  shows  the  direct  image  of  the  data  mask  A 
taken  through  the  crystal.  At  the  time  the  photograph  was 
taken,  seven  holograms  had  been  recorded  at  that  same 
location  in  the  crystal.  Distortions  apparent  in  the  shape 
of  the  mask  apertures  are  a  result  of  the  index  of 
refraction  variations  induced  in  the  crystal  by  the  recorded 
holograms.  Figure  23b  shows  the  image  reconstructed  from  a 
hologram  of  data  mask  B.  Figure  23c  presents  the  Exclusive 
Or  result  obtained  with  both  images  presented  together  at 
the  video  camera.  The  relative  phase  and  amplitude  of  the 
beams  were  adjusted  to  give  the  best  visual  Exclusive  Or 
result.  The  result  is  exact  in  that  Figure  23c  does 
represent  the  bit  by  bit  Exclusive  Or  of  data  in  Figures  23a 
and  23b.  It  is  apparent  that  some  optical  bits  in  the 
result  are  brighter  than  others.  Also,  apparent  on  the 
original  photograph,  though  not  on  this  figure,  there  is 
optical  power  present  at  the  locations  of  logical  zero  bits; 
particularly  those  that  result  from  the  Exclusive  Or  of  two 
ones  in  the  input  data  arrays.  The  probability  distribution 
of  the  power  incident  at  output  locations  in  the  result  is 
the  basis  for  calculation  of  a  projected  probability  of 
error.  This  calculation  is  the  subject  of  the  statistical 
experiments  presented  in  the  next  section  of  this  thesis. 
The  result  of  Inclusive  Or  between  masks  A  and  B  is  shown  in 
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figure  24a.  Recall  fron  Chapter  II  that  the  Inclusive  Or 
result  is  produced  by  adjusting  the  optical  system  so  that 
wavefronts  from  the  direct  image  and  from  the  reconstructed 
image  are  in  phase  at  the  detector  plane.  It  is  evident  in 
the  figure  that  binary  ones  in  the  result  produced  by  the 

Inclusive  Or  of  ones  in  both  input  masks  are  much  brighter 

than  ones  present  in  a  single  input  mask.  However,  by 
properly  placing  a  detection  threshold,  the  proper  Inclusive 
Or  result  would  be  obtained.  Figures  24b  and  24c  return  to 
Exclusive  Or  processing  to  examine  the  effect  of  a  10% 

increase  and  decrease,  respectively,  in  the  power  of  the 
reference  beam  used  to  reconstruct  the  hologram  of  data  mask 
B.  The  Exclusive  Or  result  remains  essentially  correct, 
though  in  both  cases  increased  power  in  a  bit  near  the  lower 
right  hand  corner  becomes  apparent.  But,  with  proper 
setting  of  the  detection  threshold,  correctness  of  the 
result  could  be  preserved.  Figure  25  presents  the  result  of 
Exclusive  Or  of  data  mask  B  with  itself.  The  expected 

result  of  all  binary  zero  output  bits  does  occur.  The 
figure  demonstrates  that  even  complete  cancellation  of  data 
page  wavefronts  does  not  give  rise  to  a  diffuse  background 
of  scattered  light  at  the  detector. 

Phase  Stabilization 

In  preparation  for  the  statistical  experiments,  the 
long  term  behavior  of  the  stability  of  the  phase 
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relationship  between  the  object  and  reference  beams  was 
studied.  The  voltage  applied  to  the  phase  modulator  was 
adjusted  to  produce  the  best  Exclusive  Or  result,  and  then 
the  change  of  the  result  over  time  was  observed.  There  was 
a  monotonic  relative  phase  shift  between  the  beams  that 
produced  cycling  of  the  result  from  Exclusive  Or  to 
Inclusive  Or  and  then  back  again.  The  period  of  these 
cycles  was  observed  to  vary  from  one  and  a  half  to  five 
minutes.  The  cause  of  these  cycles  was  first  sought  in 
thermal  expansion  of  some  optical  apparatus,  caused  by 
heating  from  the  laser  beam.  A  second  possible  explanation 
was  a  dynamic  recording  effect  taking  place  in  the  crystal. 
Both  hypotheses  were  eliminated  from  consideration  in  the 
following  way:  The  relative  phase  of  the  beams  was  set  to 
produce  a  good  Exclusive  Or  result,  then  the  laser  beam  was 
blocked  as  it  exited  from  the  laser.  The  Exclusive  Or 
result  was  observed  at  intervals  of  time  afterwards  by 
briefly  unblocking  the  beam.  The  rate  of  the  phase  drift 
measured  in  this  situation  was  unchanged  from  the  rate 
observed  with  a  continous  beam.  This  result  focused 
attention  on  the  laser  itself  as  the  source  of  the  phase 
drift.  There  was  a  difference  of  about  70  mm  between  the 
path  lengths  of  the  object  and  reference  beams,  from  the 
point  where  they  separated  at  the  beam  splitter  to  their 
intersection  at  the  crystal.  A  change  in  optical  frequency 
of  the  laser  would  appear  as  a  relative  phase  shift  between 


the  beams  due  to  the  path  length  difference.  Careful 
measurement  of  the  frequency  stabiltity  of  the  laser  was 
made  using  a  scanning  spectrum  analyser.  After  an  initial 
warm-up  period,  the  frequency  of  the  laser  did  change  at  a 
rate  as  great  as  95  MHz/minute.  This  was  translated  into 
the  equivalent  phaseshift  using  the  formula: 

A*  -  -2~  Av.  (3) 

where: 

At  is  the  change  in  phase  in  radians, 
d  is  the  path  length  difference  of  the  two  beams, 
c  is  the  speed  of  light, 

Av  is  the  change  in  beam  oscillation  frequency. 

The  calculated  phaseshift  rate  of  8.64  degrees/min  was  more 
than  an  order  of  magnitude  below  the  observed  rate  of  240 
degrees/minute.  Therefore,  to  confirm  this  finding,  the 
beam  path  lengths  were  adjusted  to  be  as  nearly  equal  as 
possible.  The  observed  phase  shift  continued  at  the  same 
rate.  Then  the  beam  path  that  initially  had  been  the 
shorter  was  made  very  much  longer  than  the  other  beam  path. 
The  phase  shift  was  unchanged  in  direction  or  magnitude. 
These  observations  eliminated  laser  frequency  drift  as  a 
possible  source  of  the  phase  shift. 

The  only  remaining  possible  cause  was  a  general 
thermal  expansion  and  contraction  of  the  optics  bench  and 
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the  optical  components.  The  air  conditioning  system  of  the 
building  made  it  impossible  to  maintain  a  constant  room 
temperature  to  within  better  than  plus  or  minus  three 
degrees  Celsius.  Also,  even  if  possible,  such  tight  control 
of  the  temperature  would  not  be  desirable  for  future 
experiments.  However,  some  way  of  controlling  the  relative 
phase  of  the  beams  was  needed.  The  experimental  apparatus 
available  to  digitize  video  data  was  expected  to  take  more 
than  1.5  minutes  to  store  an  entire  image.  Obviously  the 
current  rate  of  phase  change  would  produce  results  that 
represented  no  particular  phase  relationship. 

A  phase  stabilization  feedback  system  was  devised  to 
make  consistent  measurements  possible.  A  block  diagram  of 
the  system  is  shown  in  Figure  26.  The  heart  of  the 
stabilization  system  was  a  synchronous  lock-in  amplifier. 
The  reference  oscillator  in  the  lock-in  amplifier  provided  a 
10  Ez  sinusoidal  voltage  signal.  This  signal  was  summed  by 
an  op  amp  circuit  as  a  dither  signal  to  the  DC  voltage  that 
controlled  the  reference  beam  phase  modulator.  The  combined 
signal  was  amplified  by  a  high-voltage  op  amp  power  supply 
and  applied  to  the  phase  modulator.  It  is  the  property  of 
the  Exclusive  Or  operation  that  when  the  phase  of  the 
reference  and  object  beams  differ  by  exactly  180  degrees, 
the  total  optical  power  incident  on  the  detector  plane  will 
be  a  minimum.  A  low  pass  filter  at  the  input  of  the  lock-in 
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amplifier  was  used  to  integrate  spatially  over  the  entire 
image  of  the  Exclusive  Or  result  because  video  frames  are 
repeated  30  times  per  second,  any  component  of  the  video 
signal  occurring  at  10  Hz  must  represent  variations  in  the 
spatially  integrated  brightness  of  the  entire  frame.  This 
is  the  reason  10  Bz  was  chosen  as  the  reference  oscillator 
frequency.  The  synchronous  amplifier  portion  of  the  lock-in 
amplifier  produces  an  output  given  by: 

V(t)  ■  A  cos(ufT  +  6r>  exp[-8(t  -  t))cIt,  (4) 

where: 

V(c)  Is  che  output  voltage  of  the  lock-in  amplifier, 

V  (t)  is  the  Input  voltage  of  the  lock-in  amplifier, 
is  the  radian  frequency  of  the  reference  oscillator, 

A;  B,  are  adjustable  constants. 

That  is,  the  output  is  the  time  average  of  the  amplitude  of 
the  component  of  the  input  signal  that  varies  in  phase  with 
the  output  of  the  reference  oscillator.  This  provides  a 
very  sensitive  way  of  detecting  a  small  signal  in  the 
presence  of  noise.  Figure  27a  shows  the  variation  of  the 
average  video  voltage  signal  produced  by  the  sinusoidal 
phase  modulation  when  the  average  phase  difference  between 
the  object  and  reference  beams  is  far  from  180  degrees. 
Synchronous  '  detection  of  the  video  signal  would  produce  a 
negative  DC  voltage  at  the  output  of  the  syw  hronous 


amplifier.  The  time  integral  of  this  phase  error  signal  is 
sunned  to  the  voltage  controlling  the  phase  modulator. 
Figures  27b  and  27c  show  how  the  detected  power  signal  would 
change  as  the  phases  o£  the  reference  and  object  beans 
approached  tha  correct  relationship.  If  the  phase  should 
drift  away  from  the  optimal  setting  in  the  opposite 
direction*  a  positive  error  signal  voltage  would  result* 
again  returning  the  average  phase  to  the  optimal  setting. 
Figure  28  shows  the  circuit  built  to  perform  the  time 
integration  of  the  phase  error  signal  produced  by  the 
lock-in  amplifier  and  the  weighted  summation  of  the  signals 
controlling  the  phase  modulator. 

In  operation,  the  amplitude  c.f  the  dither  signal  was 
set  to  produce  a  ±15  degree  phase  variation  in  the  reference 
beam.  The  effect  of  this  on  the  Exclusive  Or  result  was 
barely  discernable  in  the  image  displayed  on  the  video 
monitor  screen.  The  DC  voltage  was  adjusted  to  give  the 
best  visual  Exclusive  Or  result.  The  feedback  gain  was 
increased  until  oscillations  in  the  feedback  voltage  were 
observed*  then  the  gain  was  set  just  below  that  point.  The 
phase  stabilization  system  did  prove  capable  of  tracking  and 
correcting  for  the  phase  drift  observed  in  the  Exclusive  Or 
results.  It  was  designed,  however*  to  follow  the  long  term 
shifts  and  not  the  short  high  frequency  disturbances  that 
can  come  from  room  vibrations  and  air  currents.  Therefore, 
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under  even  fche  best  conditions,  brief  excursions  front  a  good 
Exclusive  Or  result  were  still  observed, 
statistical  Reliability  Experiments.. 

In  order  to  show  that  Exclusive  Or  processing  could 
be  performed  repeatably,  a  series  of  experiments  was 
undertaken  to  measure  data  that  would  allow  calculation  of  a 
probability  of  error,  also  known  as  the  bit  error  rate,  for 
the  processing  operation.  All  experiments  that  provided 
statistical  data  for  Exclusive  Or  processing  proceeded  in  a 
similar  manner.  First,  a  hologram  of  one  of  the  two 
available  data  masks  was  recorded  in  the  crystal.  This 
recording  was  done  at  the  same  position  in  the  crystal  for 
all  experiments,  but  the  crystal  was  rotated  to  a  different 
angle  with  respect  to  the  beams  for  each  recording. 
Recording  was  done  with  40  of  optical  power  in  the  object 
beam,  and  4.0  mw  of  optical  power  in  the  reference  beam. 
The  exposure  time  was  30  seconds.  When  the  recording  of  a 
hologram  was  complete,  the  object  beam  mask  was  altered. 
This  was  done  either  by  translating  the  mask  used  for 
recording  by  one  column,  or  by  replacing  it  with  the  other 
available  mask.  Then  the  reference  beam  was  turned  on  and 
the  reconstructed  image  of  the  first  mask  was  observed  with 
the  video  system.  The  power  in  the  reference  beam  was 
adjusted  until  the  intensity  at  the  center  of  the  apertures, 
as  measured  by  the  video  analyser,  was  one  fourth  of  the 
maximum  brightness  to  which  the  video  system  can  respond. 
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The  position  of  the  center  of  an  aperture  in  the 
reconstructed  image  was  narked  with  the  video  analyser 
cursors.  The  reference  bean  was  turned  off  and  the  object 
bean  was  turned  on.  The  inage  of  the  data  mask  was  aligned 
with  the  reconstructed  image  of  the  first  mask  by  using  the 
position  marked  with  the  video  analyser  cursors.  Then  the 
power  in  the  object  beam  was  adjusted  to  produce  the  same 
brightness  at  the  center  of  the  apertures  as  was  measured 
for  the  reconstructed  image.  With  both  beams  turned  on,  the 
DC  voltage  applied  to  the  reference  beam  phase  modulator  was 
adjusted  to  give  an  Exclusive  Or  result  that  appeared  to  be 
the  best.  The  switch  applying  the  dither  voltage  in  the 
phase  stabilization  system  was  then  closed.  The  phase 
stabilization  system  would  lock  in  the  proper  phase 
relationship  between  the  object  and  reference  beams.  The 
appearance  of  the  Exclusive  Or  result  was  fine  tuned  by 
adjusting  the  voltage  applied  to  the  reference  beam 
amplitude  modulator.  This  fine  tuning  was  to  produce  the 
greatest  observed  ratio  between  the  brightness  of  the  ones 
in  the  result  due  to  the  reference  beam  to  the  brightness  of 
the  zeros  due  to  destructive  interference  of  both  images. 
When  adjustments  were  completed,  the  command  to  digitize  the 
video  signal  was  issued  to  the  VIDEO  computer  program. 

The  raw  data  were  12  bit  positive  magnitude  numbers 
Each  number  represented  the  digitized  value  of  the  video 
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signal  voltage  related  to  the  intensity  of  light  incident  on 
one  position  of  the  vidicon  photocathode.  The  operating 
principles  of  the  video  analyser  dictated  that  sequential 
data  points  represent  vertically  adjacent  positions,  taken 
from  the  top  of  the  screen  to  the  bottom.  When  one  vertical 
scan  was  completed,  the  horizontal  position  of  the  scan  was 
adjusted  by  the  computer  and  the  next  scan  was  taken.  The 
video  analyser  was  calibrated  so  that,  in  terms  of  distance 
on  the  video  photocathode,  the  separation  of  neighboring 
sample  points  within  a  vertical  scan  was  equal  to  the 
separation  between  neighboring  scans.  A  rectangular  portion 
of  the  video  image  was  scanned.  The  scanned  area  was 
sampled  272  times  along  the  vertical  dimension  and  362  times 
along  the  horizontal  dimension.  Depending  on  the 
registration  of  the  sampled  area  with  the  data  mask  image, 
an  array  was  sampled  that  included  seven  or  eight  rows  and 
nine  or  10  columns  of  optical  bit  locations.  The  time 
required  to  scan  the  entire  rectangular  area  was 
approximately  one  and  one-half  minutes.  The  monotonic  drift 
of  the  relative  phase  of  the  object  and  reference  beams 
during  this  period  was  compensated  by  the  phase 
stabilization  system.  However,  occasional  brief  excursions 
from  a  good  Exclusive  Or  result  were  observed,  and  data 
representing  these  excursions  was  unavoidably  recorded. 
Although  precautions  were  taken  to  isolate  the  experiment 
from  building  vibrations  and  air  currents,  remnants  of  these 


disturbing  influences  are  the  suspected  source  of  the 
etc  rsions.  The  data  were  temporarily  stored  as  disk  files 
on  t.*e  MicroNova  compter  system,  then  transferred  over  a 
serial  communication  line  to  the  Eclipse  S250  AOS  computer 
system  for  analysis. 

In  a  practical  processing  Bystem,  the  vidicon 
photocathode  would  be  replaced  by  an  array  of 
photodetectors,  one  detector  at  the  position  of  each  optical 
output  bit.  Detectors  based  on  one  of  a  number  of 
principles  would  serve  the  purpose:  charge-coupled  devices, 
charge-injection  devices,  photoconductive  devices,  or 
photodiode  devices.  Any  form  of  detector  that  is  used, 
however,  will  produce  an  output  voltage,  or  equivalently  an 
output  current,  that  is  proportional  to  the  two  dimensional 
spatial  integral  of  the  optical  power  incident  on  the 
detector.  The  optical  power  falling  on  any  element  in  the 
detector  array  of  the  processing  system  will  in  principle 
assume  one  of  two  values.  Zero  optical  power  would 
correspond  to  a  binary  zero  state;  a  fixed,  nonzero,  value 
of  optical  power  would  correspond  to  a  binary  one  state.  In 
reality,  of  course,  the  power  falling  on  detector  elements 
assumes  a  distribution  of  values.  Among  the  factors 
contributing  to  randomization  of  the  detected  power  values 
are  imperfections  in  the  optical  elements,  disturbances  in 
the  air  through  which  the  light  travels,  vibration  of  the 
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optical  components,  and  imperfect  holographic  recording  and 
reconstruction  of  wavefronts  due  to  properties  of  the 
crystal  or  properties  of  the  recording  beams.  To  obtain  a 
binary  output  from  a  distribution  of  power  values,  a 
threshold  value  of  power  must  be  selected.  Elements  - 
detecting  a  level  of  power  exceeding  the  threshold  are 
considered  to  produce  a  binary  output  of  one:  elements 
detecting  a  level  of  power  less  than  the  threshold  produce  a 
binary  xero.  From  the  nature  of  the  distribution  of 
detected  power  levels,  some  elements  that  should,  on  the 
basis  of  the  input  data  to  the  logic  function,  produce  an 
output  of  zero  will,  in  fact,  produce  an  output  of  one.  The 
probability  of  this  type  of  error  occuring  is  called  the 
probability  of  false  alarm,  ?  ,  and  is  given  by: 

a* 

m 

W  s  po  J  Vu,0)  du*  <5 


where: 

P_  la  che  probability  of  falaa  alarm, 

VI  la  Che  threshold  value  of  power, 

K.  la  the  probability  of  a  binary  zero  output, 

Pq(u|0)  la  the  conditional  probability  density  function  of  the 
0  optical  power  given  an  output  binary  aero. 


Conversely,  some  detectors  that  should  produce  an  output  of 
one  will  produce  an  output  of  zero.  The  probability  of  this 
type  of  error  occurring  is  called  the  probability  of  miss, 
F^,  and  is  given  by: 


93 


(  . 


( 


Ws*i  j, 


/  T  p^ull)  du, 


(6) 


where: 

I*  Is  the  probability  of  a  atlas, 

V?  is  the  threshold  value  of  the  power, 

is  the  probability  of  a  binary  one  output, 
p*(u|l)  la  the  conditional  probability  density  function  of  the 
optical  power  gives  an  output  binary  osa. 


The  total  probability  that  a  detector  will  produce  an  output 
that  does,  not  correctly  represent  the  result  of  the 
Exclusive  Or  operation  is: 

W  “  W  +  VV’  (7) 

where: 

la  the  total  probability  of  error. 

To  calculate  the  total  probability  of  error,  P£, 
implied  by  the  video  data  taken  in  the  Exclusive  Or 

processing  experiments,  probability  density  functions  were 
chosen  that  represent  the  distribution  of  detected  powers, 
and  a  threshold  value  of  power  was  selected.  Previous 
analyses  of  detection  of  binary  signals  in  the  presence  of 
noise  {41,42]  have  dealt  with  radio  communication.  A 
similar  treatment  will  be  used  for  the  case  of  detection  of 
binary  optical  signals.  The  situation  for  detection  of 

results  of  Exclusive  Or  optical  processing  is  more  complex, 
however.  There  are  four  states  the  two  binary  inputs  to  an 

_ _ _ &stu 
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Exclusive  Or  operation  can  have.  Reasonably  then,  four 
probability  density  funcions  will  be  necessary  to  describe 
the  distribution  of  detected  powers.  An  important  factor  to 
consider  in  the  choice  of  distribution  functions  is  the 
electronic  video  noise  that  is  combined  with  the  measured 
power  values.  The  previous  analyses  for  binary  radio 
communication  have  used  a  Rayleigh  distribution  to  represent 
the  power  detected  when  a  binary  zero  is  transmitted. 
However,  video  noise  was  found  to  overwhelm  completely  any 
detected  power  at  detector  locations  corresponding  to  two 
binary  zero  inputs  to  the  Exclusive  Or  operation.  Therefore 
^  a  Gaussian  distribution  function  was  chosen  to  represent 

these  locations: 


po(u,00)  "  \i/2 —  -  uoo)2/2°oo1,  (8> 

(2lr)  °oo 

where: 

Pq(u|00)  Is  the  conditional  probability  density  function  given 
that  both  Input  bits  are  zero, 

0OQ  le  the  standard  deviation  of  the  distribution, 

Uqq  Is  the  nean  of  the  distribution. 


The  situation  for  the  other  three  distributions  is  analogous 
to  reception  of  transmitted  signal  power  in  the  presence  of 
noise  and  fading.  In  the  case  of  the  Pu  distribution,  the 
detected  power  results  from  imperfect  destructive 
interference  of  the  imaged  and  diffracted  wavefronts.  For 


the  Pol  and  Pio  distributions,  the  signal  power  is  from 
Exclusive  Or  output  bits  with  a  value  of  binary  one. 
Therefore,  the  Rician  distribution  (43]  was  used  to  model 
P01»  *10 »  an<*  *u « 


P0(u|ll)  -  -f- exp[-<u2  +  aJp/Eo^]  I0(uun/o21) 


*l(u|01)  -  -f-  exP[-(u2  +  uJ^/20^]  l0C»«01/a5l)' 


PjtullO)  -  -f-  exp(-(u2  +  uJ0)/2oJq]  I0Cuu10/oJ0). 

°10 

where: 

P0(u|ll),  p^(u|01) ,  and  p^ullO)  are  the  conditional  probability 
density  functions  given  binary  Inputs  11,  01,  and  10  respectively. 

Ig  is  the  modified  Bessel  function  of  the  first  kind  and  zero  order. 


To  obtain  the  total  power  distributions  associated  with 
detection  of  a  binary  one  or  a  binary  zero  the  four  density 
funtions  will  be  summed  in  pairs  appropriately: 


pQ(u|0)  •  Pq(u|00)  +  p0(ulll>, 


p.(u|l)  -  p.(u|01)  +  p  (u|10). 


The  f list  step  in  fitting  the  chosen  distribution 
functions  to  the  neasured  data  was  to  relate  the  video 
signal  voltage  levels  to  incident  power.  Only  relative 
power  levels  are  important,  since  scaling  all  measured 
powers  by  the  same  factor  does  not  affect  the  probability  of 
error.  The  relation  of  video  signal  voltage  to  incident 
power  is: 

Vy  -  I*  (11) 

where: 

V  is  che  video  signal  voltage, 
lv  is  the  incident  optical  intensity, 

Y°la  a  characteristic  of  the  photocathode. 

The  Sb2S3  vidicon  photocathode  has  a  y  of  0.65.  Therefore, 
a  program  was  written  to  correct  all  measured  voltage 
samples  according  to  the  formula: 

Vc  -  v<1/lr),  (12) 

where: 

is  the  voltage  corrected  to  represent  the  optical  power. 

This  program  also  converted  the  12-bit  integer  format  of  the 
raw  data  into  standard  Data  General  floating  point  number 
format. 


The  next  step  was  to  nodel  discrete  detectors  from 
the  array  of  video  sample  values.  Square  detector  elements 
were  modeled  as  centered  on  the  positions  of  the  optical 
output  bits.  The  detectors  could  be  modeled  so  that  the 
length  of  each  detector  element  equaled  the  separation  of 
neighboring  optical  bit  positions.  Alternatively,  the  edge 
length  could  be  smaller,  leaving  "dead  space"  between 

m 

detector  elements.  Optical  power  for  each  output  bit  in  the 
sampled  results  was  observed  to  be  confined  to  a  region  near 
the  central  position  for  the  bit.  Therefore,  the  detectors 
could  be  made  small  and  still  intercept  all  incident  optical 
power.  A  benefit  of  using  small  detector  areas  was 
improvement  of  the  signal  to  noise  ratio  of  the  measured 
values.  Electrical  noise  from  the  video  equipment 
constituted  a  substantial  part  of  the  sampled  signal.  By 
using  the  minimum  number  of  sample  points  in  a  detector 
element,  the  contribution  of  noire  was  decreased  without 
affecting  contributions  representing  incident  optical  power. 
Calculations  of  probability  of  error  were  done  using  a  range 
of  modeled  deteccor  sizes.  The  detector  size  that  most 
frequently  gave  the  lowest  probability  of  error  was  eight 
samples  square.  Using  34  samples  as  the  measured  center  to 
center  spacing  between  optical  bits,  and  recalling  that  the 
diameter  of  input  mask  apertures  is  one  fourth  the  center  to 
center  spacing,  the  edge  length  of  the  detector  giving  the 
lowest  probability  of  error  is  calculated  to  be  1.06  times 


the  diaaeter  of  the  iaage  of  the  mask  aperture 


Geoaetric  distortion  in  the  raster  scan  pattern  of 
the  video  caaera  caused  the  optical  output  bit  locations  to 
fall  in  an  array  that  was  not  precisely  orthogonal.  with 
the  columns  of  optical  bits  aligned  vertically*  rows  of  bits 
aade  an  angle  of  0.9  degrees  with  horizontal*  rising  to  the 
right.  Using  the  VIDMEM  program  to  control  the  digital 
video  memory  connected  to  the  Micronova  computer  system,  the 
vertical  and  horizontal  location  of  the  centers  of  the 
optical  bits  at  the  four  corners  of  the  sampled  array  were 
determined.  These  locations  served  as  the  input  to  the 
progaa  running  on  the  Eclipse  S250  system  that  calculated 
and  corrected  for  the  distortion.  The  calculated  positions 
cf  the  detector  locations  were  checked  by  superimposing  them 
cn  the  data  displayed  by  the  VIDMEM  program. 

Square  detector  areas  of  the  selected  edge  length 
were  defined  at  positions  centered  on  all  optical  bit 
locations  in  the  sampled  array.  All  sample  values  falling 
within  each  detector  area  were  summed  to  give  a  composite 
measurement  of  optical  power  incident  on  the  detector. 
Included  in  all  sample  values  was  a  video  pedestal  level. 
This  is  a  DC  voltage  level  that  varied  with  position  on  the 
vidicon  photocathode  and  also  changed  slowly  over  time.  To 
remove  the  effect  of  this  offset  from  the  data*  a  file 
containing  sampled  values  of  the  video  signal  with  no  light 
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incident  on  the  vidicon  photocathode  was  always  taken  during 
each  experimental  session.  Sample  values  stored  in  this 
file  also  were  summed  over  the  modeled  detector  areas.  For 
each  detector,  the  sum  from  the  background  file  data  samples 
was  subtracted  from  the  sum  of  the  optical  processing  data 
file  samples  to  give  a  final  measure  of  the  optical  power 
incident  on  each  detector. 

From  records  of  the  binary  input  data  masks  for  the 

Exclusive  Or  operation,  detector  powers  were  classified  into 

four  groups,  corresponding  to  the  four  different  possible 

states  of  the  input  bits.  The  number  of  values  falling  into 

each  classification  were  approximately  equal.  Por  each 

class,  the  average  power  and  the  standard  deviation  were 

calculated.  For  the  PQ0  distribution,  u^  in  Equation  8 

was  set  equal  to  the  calculated  average  for  the 

corresponding  class,  oQ0  was  set  equal  to  the  standard 

deviation.  For  the  other  three  distributions,  u  in 

xx 

Equation  9  was  set  equal  to  the  calculated  mean,  and  «xx 
was  set  equal  to  the  standard  deviation.  Sample  plots  of 
the  probability  distribution  functions  obtained  this  way  are 
given  in  Figure  29. 

There  are  several  criteria  upon  which  to  select  a 
threshold  value  of  power.  Since  an  output  bit  falls  into 
any  of  the  four  classes  with  equal  probability,  the 
criterion  that  results  in  the  lowest  probability  of  error 
states  that  the  threshold  is  chosen  at  the  value  of  power 
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where  pQ  equals  .  A  program  using  the  procedure  of 
interval  bisection  was  written  to  find  this  threshold  power 
value.  The  total  probability  of  error  was  then  calculated 
according  to  Equation  7.  Values  for  the  required  integrals 
of  the  Rician  distribution  have  been  tabulated  (44]. 
However,  it  was  more  convenient  to  evaluate  them 
numerically.  The  numerical  integration  program  produced 
values  for  PM,  Pf,  and  ^  . 

A  total  of  10  data  files  containing  584  optical  bit 
positions  were  used  in  the  determination  of  the  probability 
of  error  measurement  for  the  Exclusive  Or  processing  system. 
Including  background  level  data  files,  a  total  of  1,378,496 
sample  values  were  collected  and  processed.  Each  data  file 
was  processed  for  an  average  of  four  different  detector 
sizes. 

As  stated  previously,  some  data  were  collected  that 
represented  excursions  from  the  nominal  Exclusive  Or  result. 
These  are  considered  to  be  artifacts  of  the  long  period  of 
time  required  to  collect  the  data  in  each  Exclusive  Or 
result  image.  As  such,  they  are  not  representative  of  the 
true  operating  characteristics  of  the  processing  system,  and 
so  were  discarded.  To  avoid  unduly  biasing  the  data,  the 
following  precaution  was  observed.  Data  for  the  output 
positions  in  each  column  of  optical  bits  are  taken  within 
the  same  span  of  time.  Therefore,  all  positions  in  a 


column  represent  the  same  relative  phase  o£  the  object  and 
reference  beams*  So*  data  were  rejected  or  accepted  on  a 
column  by  column  basis. 

Using  the  statistical  procedure  described  above*  the 

values  of  PM»1.49xl0"3*  Pr-9. 10x10 ”4*  and  PE«2.40xlQ~3  were 

obtained  for  the  Exclusive  Or  processing  system.  The 

probability  density  distributions  shown  in  Figure  29  are  the 

functions  used  to  arrive  at  these  values.  These  values  are 

quite  high  as  compared  to  typical  figures  of  1x10 

probability  of  error  for  electronic  digital  processing 

systems.  The  values  obtained  may  be  taken  as  preliminary 
• 

Indications  only*  All  technologies  applied  to  digital 
processing  in  the  past  have  shown  steady  Improvement  of 
processing  reliability  as  knowledge  was  gained  about  the 
factors  Influencing  the  systems'  probability  of  error. 
There  is  no  reason  to  believe  Exclusive  Or  optical 
processing  will  not  also  become  more  reliable  as  it  is  more 
fully  understood. 

One  important  factor  to  consider  in  the  determination 
of  the  probability  of  error  is  the  effect  of  video  noise  on 
the  distributions  of  measured  powers.  The  uncertainty 
contributed  to  the  measurements  by  video  noise  was  evaluated 
in  the  following  way.  Two  background  data  files  taken  on 
the  same  day  were  used  to  provide  data  to  the  program  for 
integrating  measured  optical  power  over  the  modeled  detector 
areas.  One  file  served  in  the  usual  role  for  a  background 


data  file  while  the  other  substituted  for  a  data  file 
containing  optical  output  bit  power  measurements.  The 
standard  deviation  for  integrated  detector  powers  produced 
in  this  way  was  calculated.  In  general,  this  standard 
deviation  was  approximately  half  of  the  standard  deviation 
observed  in  measurements  of  standard  data  files.  If  it  is 
assumed  that  the  video  noise  is  a  random  variable  added  to 
the  random  distribution  of  detected  optical  powers,  then  the 
probability  density  functions  of  the  error  measurements  are 
given  by  the  convolution  of  the  video  noise  probability 
density  functions  with  the  true  optical  power  probability 
density  functions.  Further  assuming  that  the  behavior  of 
Rician  distributions  upon  convolution  is  similar  to  the 
behavior  of  Gaussian  distributions,  the  previously  measured 
standard  deviations  represent  the  sum  of  the  standard 
deviations  for  the  video  noise  distribution  and  the  true 
optical  power  distributions.  Using  new  values  for  the 
standard  deviations  of  the  probability  function  for  tne  four 
classifications  of  Exclusive  Or  processing  output  bits,  the 
probability  of  error  was  recalculated.  The  values  obtained 
were:  P  ■8.03xl(T8  ,  P_»4.72xi0"4  ,  and  E, ■4.72xl0”4  .  These 
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values  show  that  video  noise  was  an  important  influence  in 
determining  the  probability  of  error  for  Exclusive  Or 
processing. 

The  experiments  demonstrating  the  principle  of 


Exclusive  Or  processing,  and  the  experiments  that  provided 
data  for  determination  of  the  probability  of  error  provided 
many  insights  into  the  factors  involved  in  the  operation  of 
the  processing  system.  A  displacement  parallel  to  the  plane 
of  the  recording  beams  of  the  reconstructed  data  mask  image 
with  respect  to  the  direct  data  mask  image  was  observed. 
The  importance  of  having  equal  spatial  frequency  content  in 
the  reconstructed  and  direct  data  page  images  was 
discovered.  Two  methods  of  achieving  good  agreement  in  the 
images  at  the  detector  plane  were  used:  placement  of  an 
aperture  at  the  crystal  plane,  and  use  of  an  expanded 
reference  beam..  The  latter  method  was  found  to  be  superior, 
since  it  allowed  both  images  to  have  the  best  possible 
resolution.  An  optical  and  electronic  feedback  system  for 
stabilizing  the  relative  phase  of  the  object  and  reference 
beams  during  data  processing  operations  was  implemented. 
The  system  proved  effective  for  compensating  gradual  drift 
in  the  relative  phase  of  the  beams.  The  performance  of  the 
stabilization  system  might  be  considerably  improved  by 
directly  monitoring  the  light  at  the  output  of  the 
processing  system,  rather  than  monitoring  the  video  signal. 
The  Exclusive  Or  operation  was  repeatably  obtained,  and  a 
preliminary  measurement  was  made  of  the  probability  of 
processing  error. 


The  Wand  Optical  Processing 


The  operating  principles  of  the  Hand  form  of  optical 
processing  system  were  demonstrated  by  the  experiments 
described  in  this  section  of  the  thesis.  A  detailed  account 
is  given  of  factors  that  must  be  considered  when 
constructing  a  Nand  processing  system. 

There  is  an  important  practical  difference  between 
the  operation  of  the  Exclusive  Or  processing  system  and  the 
operation  of  the  Nand  processing  system.  For  data 
processing  with  the  Exclusive  Or  processor .  light  from  the 
reference  beam  is  diffracted  by  a  recorded  hologram  to 
contribute  to  the  output  object  beam  wavefront.  For  data 
processing  with  the  Hand  Processor,  light  from  the  object 
beam  is  diffracted  by  the  recorded  holograms  to  reconstruct, 
the  reference  beam  wavefront.  The  holograms  recorded  in  the 
experiments  were  of  lo*»  diffraction  efficiency,  on  the  order 
of  one  percent.  Low  efficiency  holograms  would  have  to  be 
used  in  a  practical  processing  system  where  many  holograms 
are  recorded  at  the  same  location  in  the  crystal. 
Therefore,  to  obtain  detectable  power  in  the  reconstructed 
reference  beams,  considerable  power  must  be  available  in  the 
object  beam  during  the  data  processing  operation.  A 
preliminary  test  of  the  ability  of  the  optical  6y8tem  to 
diffract  measurable  power  from  the  object  beam  into  the 
reference  beam  was  conducted.  Osing  an  optical  arrangement 


similar  to  that  used  for  the  Exclusive  Or  processing 
experiments,  the  hologram  of  a  data  page  of  a  32  by  32  array 
of  transparent  apertures  was  recorded.  Then,  with  the 
reference  beam  blocked,  light  diffracted  from  the  object 
beam  into  the  reference  beam  path  was  measured.  Using  all 
the  power  the  optical  system  could  provide  in  the  object 
beam,  the  power  detected  in  the  reference  beam  barely 
registered  on  the  power  meter,  at  a  level  of  a  few 
nanowatts. 

On  the  basis  of  this  result,  the  original  plan  to 
use  selected  apertures  from  the  mask  used  in  the  Exclusive 
Or  experiments  as  the  object  beam  mask  in  the  Nand 
experiments  was  modified.  An  aluminum  plate,  having  a 
single  row  of  eight  3.2  mm  diameter  apertures  with  a 
center-to-center  spacing  of  4.8  mm,  was  available.  Using 
this  plate  as  the  object  beam  mask  had  several  advantages. 
The  total  area  of  apertures  in  the  plate  was  greater  than 
the  area  of  the  apertures  in  the  mask  for  the  Exclusive  Or 
experiments.  For  a  given  intensity  incident  on  the  mask, 
more  light  passed  through  the  aluminum  plate  mask.  Also, 
the  data  pattern  in  the  new  mask  could  be  altered  by 
covering  selected  apertures  with  opaque  tape.  Within  the 
limitation  of  eight  apertures,  there  was  no  restriction  on 
the  data  patterns  to  be  used. 

A  hologram  of  the  new  mask  was  recorded.  This  time, 
with  the  reference  beam  blocked,  a  power  of  several 


microwatts  was  diffracted  from  the  object  beam  into  the 
reference  beam  path.  This  level  of  power  was  quite 
satisfactory  for  use  in  the  Mud  processing  experiments. 

Another  difference  between  Exclusive  Or  processing 
and  Nand  processing  is  that  the  critical  phase  relationships 
needed  for  Nand  processing  are  recorded  in  the  holograms. 
The  opportunity  to  control  the  phase  of  beams  during  the 
data  processing  operation  does  not  exist  as  it  does  with  the 
Exclusive  Or  processor.  Therefore,  the  relative  phase  of 
the  object  and  reference  beams  had  to  be  adjusted  and 
monitored  while  recording  holograms  for  Nand  processing. 
The  optical  components  of  the  monitoring  system  are  shown  in 
Figure  30.  Light  from  one  of  the  eight  object  beam  maslc 
apertures  was  redirected  with  a  mirror  and  combined  with  a 
portion  of  the  reference  beam  by  a  beamsplitter.  The 
combined  beams  entered  the  objective  of  a  microscope. 
Fringes  from  the  interference  of  the  two  beams  were  visible 
at  the  output  of  the  microscope.  The  focused  spherical 
wavefront  from  the  object  beam  and  the  essentially  plane 
wavefront  of  the  reference  beam  produced  circular  fringes. 
At  the  center  of  the  circular  pattern  was  a  spot  that 
changed  from  light  to  dark  as  the  relative  phase  of  the 
beams  varied  over  180  degrees.  An  aperture  was  positioned 
to  pass  light  from  only  this  spot  to  an  optical  power  meter. 
Thus,  power  detected  by  the  meter  gave  a  measure  of  the 


relative  phase  of  the  object  and  reference  beams.  It  was 
determined  that  the  electronics  of  the  phase  stabilization 
system  used  for  the  Exclusive  Or  experiments  would  also 
serve  to  stabilize  the  phase  of  the  beams  while  recording 
the  Hand  processing  holograms.  The  input  to  the  lock-in 
amplifier  was  taken  from  a  voltage  output  of  the  power  meter 
monitoring  the  interference  spot.  The  stabilization  system 
could  be  adjusted  to  control  the  phase  of  the  beams  to 
produce  either  minimum  or  maximum  power  in  the  monitored 
8 pot .  The  difference  in  these  two  states  corresponds  to  a 
change  in  the  relative  phase  of  the  beams  by  180  degrees. 
Two  values  of  relative  phase  of  the  beams  differing  by  180 
degrees  are  all  that  are  needed  to  record  the  Nand 
processing  holograms. 

A  typical  experiment  to  produce  Nand  processing 
proceeded  in  the  following  manner.  First  a  mask  pattern  was 
defined  by  placing  opaque  tape  over  some  of  the  apertures  of 
the  mask  plate.  Looking  in  the  direction  of  propagation  of 
the  object  beam,  the  leftmost  aperture  was  always  left  open. 
Light  passing  through  this  aperture  was  used  by  the  phase 
monitoring  system.  The  next  two  apertures  to  the  right  were 
always  covered.  Light  passing  through  them  would  have  been 
affected  by  the  mount  of  the  mirror  used  to  deflect  the 
phase  monitoring  beam.  The  next  aperture  to  the  right  was 
designated  the  reference  bit  for  the  patterns.  The. 
remaining  four  apertures  of  the  mask  were  available  for 
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defining  the  reference  pattern  to  be  holographically 
recorded,  and  later,  the  input  data  to  be  proceaaed.  The 
hologram  of  the  naak  pattern  was  recorded  in  the  crystal 
uaing  5  n W  of  optical  power  in  the  reference  bean,  3  jtW  of 
power  in  the  object  bean,  and  an  exposure  tine  of  60 
seconds.  The  large  apertures  of  the  input  nask  produced  a 

Si 

Fourier  transforn  at  the  crystal  that  was  nuch  sore  compact 
than  the  transforn  produced  by  the  nask  used  for  the 
Exclusive  Or  experiments.  Therefore,  lower  levels  of  power 
were  used  in  the  recording  beams  to  achieve  the  exposure 
used  for  the  Exclusive  Or  experiments.  The  input  mask 
(  pattern  used  to  record  the  first  hologram  had  oper.  apertures 

for  binary  ones  in  the  reference  pattern.  Apertures  in  the 
positions  of  binary  zeros  and  the  reference  bit  were 
blocked.  The  phase  stabilization  system  was  set  to  produce 

a  bright  interference  spot.  When  the  exposure  for  the  first 
hologran  was  complete,  the  input  mask  pattern  was 
complemented.  Apertures  in  the  positions  of  binary  ones 
were  covered  and  apertures  in  the  positions  of  binary  zeros 
and  the  reference  beam  were  uncovered.  The  phase 
stabilization  system  was  set  to  produce  a  dark  interference 
spot.  The  crystal  was  exposed  for  a  second  time,  using  the 
beam  power  levels  and  exposure  duration  of  the  first 
exposure,  xf  the  reference  data  pattern  being  recorded 
contained  more  than  a  single  binary  one,  additional  exposure 


of  fcho  reference  bit  was  required.  All  apertures  on  the 
input  mask  were  covered  except  the  reference  bit.  The  phase 
stabilisation  system  was  set  to  produce  a  dark  interference 
spot.  A  third  exposure  of  the  crystal  was  produced,  using 
the  same  beam  power  levels  used  for  the  first  two  exposures. 
The  exposure  time  was  N-l  times  the  duration  for  the  first 
two  exposures,  where  N  is  the  number  of  ones  in  the 
reference  pattern. 

The  reference  beam  itself  had  previously  been 
projected  through  the  optical  system  onto  the  vidicon 
photccathode,  so  the  position  at  which  the  output  optical 
bit  would  occur  was  known.  The  pattern  used  for  the  second 
holographic  exposure,  with  all  xero  bit  positions  uncovered, 
was  placed  in  the  object  beam  mask.  This  pattern  produced 
the  optical  output  bit  containing  the  largest  optical  power. 
The  reference  beam  was  blocked,  and  the  intensity  of  the 
object  beam  was  increased  until  illumination  due  to 
diffraction  was  observed  with  the  video  system. 

The  first  Nand  processing  results  obtained  were  not 
in  the  expected  form.  Instead  of  the  circular  bright  spot 
observed  on  the  video  cystem  when  the  reference  beam  itself 
was  projected,  the  output  of  the  Ncnd  operation  occured  as  a 
vertical  band  of  illumination.  The  band  was  as  broad  as  the 
expected  spot,  but  approximately  four  times  as  long,  and 
dimmed  toward  its  ends.  In  spite  of  distortion  of  the 
output,  a  Nand  result  was  observed.  If  the  pattern  of  the 
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Input  sank  was  arranged  to  natch  the  recorded  reference 
pattern,  a  dark  atripe  would  appear  in  the  middle  of  the 
band  of  output  light.  The  stripe  was  the  darkest  when  the 
input  pattern  exactly  matched  the  refence  pattern.  The 
stripe  would  take  on  varying  degrees  of  illumination  for 
other  patterns  present  as  the  input.  The  relative  degrees 
of  illumination  observed  appeared  to  correspond  well  to 
those  predicted  on  the  basis  of  the  principles  of  the  Nand 
processor.  Uncovering  an  aperture  on  the  input  mask  that 
corresponded  to  a  binary  one  bit  in  the  reference  pattern 
would  darken  the  observed  output  bit.  The  contribution  of 
an  additional  wavefront  with  a  phase  differing  by  160 
degrees  from  the  light  present  at  the  output  is  the  cause  of 
the  illumination  decreasing  at  the  output,  though  more  light 
was  present  in  the  input. 

Output  bits  that  appear  in  the  form  of  vertical 
stripes  would  limit  the  parallel  processing  potential  of  the 
Nand  form  of  processing.  The  fora  of  the  output  bits  was 
attributed  to  the  following  cause.  Holographic  fringes  are 
recorded  only  at  locations  in  the  crystal  where  the  object 
and  reference  beams  overlap.  As  stated  previously,  the 
relatively  large  apertures  in  the  input  data  mask  produced  a 
very  compact  Fourier  transform  at  the  crystal.  Therefore, 
the  extent  of  the  recorded  hologram  was  small  with  respect 
to  the  diameter  of  the  reference  beam.  The  transform  was 
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particularly  compact  in  tha  vertical  direction  aince  the 
input  aask  contained  leas  structure  in  that  direction. 
Light  diffracted  by  the  small  recorded  hologram  did  not 
accurately  represent  the  original  wavefront  of  the  reference 
bean.  Instead,  the  reconstructed  wavefront  produced  a  beam 
that  expanded,  particularly  in  the  vertical  direction. 

To  expand  the  Fourier  transform  pattern  of  the  input 
mask,  a  diffusing  screen  was  used.  This  is  a  piece  of 
transparent  material  placed  in  the  object  beam  directly 
before  the  input  mask.  The  diffusing  screen  introduces 
phase  variations  with  high  spatial  frequency  content  onto 
the  object  beam  wavefront.  The  high  spatial  frequencies 
result  in  an  expanded  Fourier  transform  at  the  crystal. 
Many  types  of  material  were  tried  for  the  diffusing  screen 
to  match  the  size  of  the  Fourier  transform  to  the  diameter 
of  the  reference  beam.  The  best  result  was  obtain  by  using 
a  piece  of  transparent  adhesive  tape  affixed  to  the  front 
surface  of  the  mask.  The  tape  covered  all  apertures  on  the 
mask  except  the  aperture  used  in  conjunction  with  the  phase 
monitoring  system.  Thickness  variations  in  the  celluloid  of 
the  tape  provided  the  required  phase  variations  in  the 
object  beam  wavefront. 

The  new  form  of  the  Fourier  transform  required  that 
the  power  in  the  recording  beams  be  increased  to  60  pK  in 
both  the  object  and  the  reference  beams  to  obtain  practical 
holograms.  The  improvement  observed  in  the  form  of  the 


1 


114 


output  light  from  Nand  operations  with  the  diffusing  screen 
in  place  were  draaatic.  The  optical  output  bit  closely 
resembled  the  sise  and  shape  of  the  illuaination  produced  by 
the  reference  beam  itself. 

Figure  31  shows  the  results  of  an  optical  Nand 
operation  using  a  recorded  reference  pattern  of  0010. 
pigure  31a  shows  the  binary  zero  result  obtained  with  the 
input  data  pattern  aatching  the  reference  data  pattern. 
Only  a  snail  anount  of  light  is  present  in  the  null.  Figure 
31b  shows  the  light  present  when  the  input  data  pattern 
differs  froa  the  reference  pattern  by  one  bit.  Figure  31d 
shows  the  largest  possible  output  Intensity  in  response  to 
an  input  pattern,  1101}  that  is  the  coapleaent  of  the 
reference  pattern.  Figure  31c  shows  the  decrease  in  the 
intensity  when  the  input  aask  aperture  at  the  position  of  a 
binary  one  in  the  reference  pattern  is  uncovered. 

The  principle  underlying  operation  of  the  Nand 
optical  processor  has  been  demonstrated  in  the  experiments 
described  above.  Production  of  the  optical  Nand  operation 
was  aore  difficult  than  production  of  an  optical  Exclusive 
Or  operation,  but  the  potential  usefulness  of  the  Nand  fora 
of  optical  processing  aakes  it  worth  pursuing.  Three 
iaportant  results  froa  the  series  of  Nand  processing 
experiments  are  givem  First,  care  must  be  taken  to  provide' 
sufficient  optical  power  passing  through  the  input  data  aask 
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so  that  a  detectable  amount  of  power  will  be  diffracted  by 
the  holograms  onto  the  output  detector.  The  minimum  power 
passing  through  the  object  beam  mask  occurs  when  all  input 
bits  are  binary  zeros*  leaving  only  the  reference  bit 
aperture  transparent.  Second*  the  critical  task  of 

monitoring  and  controlling  the  relative  phase  of  the  object 
and  reference  beams  while  recording  Nand  processing 
holograms  can  be  accomplished  with  the  phase  stabilization 
system  described.  Finally*  attention  must  be  given  to  the 
area  of  overlap  of  the  recording  beams  at  the  crystal. 
Reconstruction  of  a  plane  wave  with  the  Fourier  transform  of 
a  mask  pattern  is  more  complicated  than  the  common  practice 
(  of  reconstructing  the  Fourier  transform  of  a  mask  with  a 

plane  wave.  Using  a  diffusing  screen  in  front  of  the  mask 
pattern  produces  the  benefical  results  of  giving  a  broader 
and  more  uniform  illumination  in  the  optical  Fourier 
transform  produced  at  the  crystal. 
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CHAPTER  VI 


CQMCLHSIOHS 

The  purpose  of  this  chapter  is  to  summarize  the 
original  contributions  made  to  the  field  of  digital  optical 
processing  by  the  work  presented  in  this  thesis.  Also, 
suggestions  are  given  for  additional  research  to  build  on 
the  results  obtained. 

Optical  Digital  Parallel  Processing  Principles 

The  operating  principles  of  two  forms  of  optical 
digital  parallel  processing  have  been  presented.  Both  forms 
of  processing  are  based  on  Boolean  logic  operations 
optically  produced  by  lightwave  interference  at  the 
detector  plane.  The  primary  optical  logic  operation  of  one 
form  of  processing  is  Exclusive  Or;  the  primary  optical 
logic  operation  of  the  other  form  is  Hand.  Both  forms  of 
processor  utilize  information  taken  from  the  logical 
truth-table  of  the  digital  operation  performed.  The 
information  is  stored  as  thick  phase  holograms  in  lithium 
niobate  crystals.  The  processors  perform  the  function  of 
searching  in  parallel  for  stored  reference  patterns  that 
match  the  input  data  to  be  processed.  The  presence  or 
absence  of  matches  determines  tho  state  of  the  binary  output 
bits  of  the  processors.  Multiple  streams  of  input  data  may 
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be  processed  simultaneously  by  accessing  the  recorded 
holograas  from  different  input  angles,  displaced  from  the 
plane  of  the  holograa  recording  beaas. 

The  optical  processing  systems  presented  are  the 
first  known  to  coabine  the  flexibility  of  digital 

truth-table  look-up  operation  with  the  power  of  full  word 
parallel  operation.  The  speed  of  operation  of  the 

processing  systems  is  essentially  liaited  only  by  the  cycle 
tiaes  of  the  input  and  output  devices.  The  practical 
feasibility  of  iapleaenting  the  processing  systems  has  been 
supported  in  two  ways.  First,  a  computer  study  of  the  size 
of  the  required  truth-tables  has  shown  that  the  amount  of 
(.  information  stored  for  producing  useful  digital  operations 

is  within  the  previously  deaonstrated  capacity  of 
holographic  storage  systems  using  electrooptic  crystals  as 
the  storage  medium.  Second,  the  optical  logic  operations 
underlying  the  processing  systems  have  been  demonstrated 
with  experimental  results. 

Truth-Table  Reduction 

To  lend  support  to  the  feasibility  of  the  optical 
processing  systems,  the  size  of  truth-tables  that  would  be 
stored  to  perform  some  useful  computations  have  been 
tabulated.  The  optical  processing  systems  can  perform  the 
function  of  identifying  stored  patterns  that  match  the  input 
(  data  even  when  the  stored  patterns  contain  "don't  care" 
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positions.  Logical  reduction  of  truth-table  information 
prior  to  incorporating  it  into  the  processing  systems  was 
determined  to  have  a  significant  effect  on  the  size  and 
complexity  of  digital  operations  that  can  be  accommodated 
within  a  limited  system  capacity. 

The  work  of  logical  reduction  of  truth-tables 

required  that  advanced  algorithms  be  identified  in  the 
literature,  and  then  realized  as  computer  programs.  An 
improvement  of  the  algorithm  used  for  determination  of  the 
prime  implicants  of  a  truth-table  was  discovered  in  the 
course  of  this  work.  The  execution  speed  and  data  storage 
efficiency  of  the  algorithm  were  increased  by  the 
(  modification.  Logical  truth-tables  containing  up  to  16 

input  variables  were  successfully  reduced.  The  tabulated 
sizes  of  reduced  truth-tables  are  of  interest  for  VLSI 
design  as  well  as  optical  truth-table  look-up  processing. 

The  flexibility  of  operation  of  the  optical 
truth-table  look-up  processing  systems  allows  the  use  of  any 
convenient  binary  representation  of  the  processed  data.  A 
Binary  Coded  Residue  number  representation  system  was 
particularly  useful.  The  absence  of  interdependence  of  the 
digit  positions  of  numbers  in  that  system  allows  the  use  of 
both  reduced  and  unreduced  truth-tables  that  are 
substantially  smaller  than  corresponding  truth-tables  for 
the  standard  binary  number  system. 

The  numerical  operations  of  addition  and 
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multiplication  were  selected  for  study.  They  are  central  to 
many  digital  processing  applications  in  need  of  parallel 
operation.  Using  previously  demonstrated  holographic 
storage  capacities  for  electrooptic  crystals,  and  results  of 
the  truth-table  reduction  and  residue  number  system  studies 
described  above,  the  determination  was  made  that  addition  or 
multiplication  of  binary  numbers  with  eight  bits  of 
precision  can  be  accomplished  in  a  single  pass  through  the 
optical  system.  Results  of  any  desired  precision  can  be 
obtained  with  multiple  processing  steps. 

Many  opportunities  exist  for  further  research  into 
truth-table  reduction  and  alternate  number  systems. 
Investigation  of  advanced  truth-table  reduction  algorithms 
would  support  areas  other  than  optical  digital  processing. 
The  same  is  true  of  advances  in  the  state  of  knowledge  about 
residue  number  representations,  .  which  are  also  under 
consideration  for  high  speed  electronic  digital  designs. 
Application  of  error  detection  and  correction  techniques 
known  for  -  residue  number  systems  to  the  Exclusive  Or  and 
Nand  processing  systems  would  be  valuable.  Truth-table 
look-up  processing  accrues  an  advantage  over  other  forms  of 
processing  as  the  complexity  of  the  implemented  operation 
increases.  The  impact  of  performing  operations  other  than 
addition  and  multiplication  on  the  optical  processing 
systems  is  worthy  of  study. 


Experimental  Results 

Experimental  studies  of  the  principles  of  the  optical 
processing  systems  were  carried  out  to  demonstrate  the 
practical  feasibility  and  to  identify  factors  influencing 
reliable  processing  operation.  In  support  of  this  thesis, 
the  logical  Exclusive  Or  of  imaged  and  holographically 
recorded  two-dimensional  pages  of  binary  data  was  produced. 
There  is  no  previous  report  of  this  operation  including 
thick  holographic  recordings  and  the  use  of  lightwave  phase 
interference  at  the  detector  plane.  A  uniform  Exclusive  Or 
result  routinely  was  achieved  over  all  data  apertures. 

Factors  affecting  the  performance  of  the  Exclusive  Or 
processing  system  were  identified.  First,  lack  of 
registration  between  direct  and  reconstructed  data  page 
images  remains  an  unexplained,  but  easily  corrected  effect. 
The  importance  of  providing  for  equivalent  spatial  frequency 
content  in  the  direct  and  reconstructed  data  page  images  has 
been  observed.  The  size  of  the  reference  beam  must  be 
■Mtched  to  the  size  of  the  optical  Fourier  transform  of  the 
data  mask.  Alternatively,  an  aperture  can  be  used  at  the 
Fourier  transform  plane  to  limit  the  spatial  frequency 
content  of  both  the  direct  and  reconstructed  images.  The 
stability  of  the  phase  relationship  between  the  object  and 
reference  beams  was  a  major  cause  of  degradation  in  the 
Exclusive  Or  results.  A  phase  stabilization  feedback  system 


va s  implemented  to  enhance  the  resistance  of  the  processing 
system  to  phase  drift  effects. 

A  preliminary  measurement  of  the  bit  error  rate  of 
the  Exclusive  Or  processing  system  was  made.  This  was  done 
by  fitting  statistical  samples  of  powers  incident  on  a 
modeled  array  of  square  output  detector  elements  to 
appropriate  probability  density  function  curves.  On  the 
basis  of  these  curves  a  threshold  value  of  power  was 
determined  that  allowed  calculation  of  a  total  probability 
of  error  foi  the  processing  operation.  The  total 
probability  of  error  value  calculated  was  4.72xl0“4  .  This 
figure  is  considerably  higher  than  bit  error  rates  for 
existing  electronic  digital  equipment.  But,  for  a  system 
whose  operating  principles  have  only  just  been  demonstrated, 
this  figure  need  not  be  disappointing.  Continued  attention 
to  improving  the  factors  noted  above  that  influence 
operation  of  the  processing  system  should  provide 
considerable  reduction  of  the  probability  of  error. 

The  operating  principles  of  the  Nand  form  of  optical 
processing  were  also  demonstrated.  This  is  the  first 
reported  demonstration  of  an  optical  Nand  operation  using 
holographic  patterns  to  define  the  phase  relationship  of  the 
contributing  beams. 

Factors  affecting  operation  of  the  Hand  processor 
were  identified.  The  need  for  appreciable  power  in  the 
object  beam  for  data  processing  was  explained.  The  phase 


stabilisation  system  designed  for  uae  in  the  Exclusive  Or 
processing  experiments  was  modified  to  monitor  and  control 
the  relative  phase  of  the  beams  recording  holograms  for  the 
Nand  processing  system.  Finally,  matching  the  size  of  the 
Fourier  transform  of  the  input  data  mask  tc  the  size  of  the 
reference  beam  was  found  to  be  important  to  maintain  the 
fidelity  of  the  reconstructed  reference  beam  wavefront.  A 
diffusing  screen  was  used  in  front  of  the  object  mask  to 
enlarge  the  Fourier  transform  to  match  the  size  of  the 
reference  beam. 

Design  of  practical  Exclusive  Or  and  Nand  optical 
processing  systems  provides  a  rich  field  for  continuing 
theoretical  and  experimental  investigation.  Theoretical 
studies  modeling  the  effects  of  the  factors  found  to 
influence  processor  operation  will  guide  development  of 
practical  processing  systems.  Investigations  of  the 
engineering  trade-offs  involved  in  selecting  the  diameter  of 
the  reference  beam  and  the  input  mask  Fourier  Transform 
would  be  extremely  useful.  Also,  determining  the  best 
spacing  for  output  bits  at  the  detector  plane  is  an 
appropriate  topic  for  theoretical  investigation.  A 
theoretical  analysis  of  the  properties  of  .  holographic 
gratings  formed  with  one  or  both  beams  encoded  with  spatial 
information  is  needed.  A  primary  goal  for  experimental  work 
should  be  the  demonstration  of  each  of  the  processing 
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systems  in  parallel  operation.  An  investigation  of  the 
effect  the  fixing  process  for  lithium  niobate  crystals  has 
on  processor  holograms  is  also  needed.  Also,  investigation 
of  the  use  of  other  thick  holographic  recording  materials, 
such  as  diebromated  gelatin,  might  be  considered. 


( 


.  -s-i:  .via.  :  -  =  ' 

’•***  -  V!'  ‘  -i 

•■*.%£•*£  :.v. 

;•  '-I- .  -&■ :  'H' 


v  APPENDIX  1 

On  the  following  pages  are  the  results  of  the 
computer  study  of  the  sizes  of  reduced  and  unreduced 
truth-tables.  The  numerical  operations  of  addition  and 
multiplication  are  included.  Also,  the  standard  binary  and 
the  Binary  Coded  Residue  number  systems  are  included.  The 
number  of  truth-table  entries  needed  for  each  output  bit 


individually  is  available. 
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Table  Al-1.  Tot"!*  number  of  reference  date  pattern*  t  >  be  recorded  to 
cor  .rucc  the  unity-reault  truth  table*  for  direct  binary 
it* on  and  multiplication  of  two  n-bit  number*. 

n 

Addition 

n+1 

llA* 

Multiplication 

2n 

l\ 

l 

1 

8 

1 

2 

48 

14 

3 

256 

111 

4 

1,280 

678 

5 

6,144 

3,733 

6 

28,672 

18,953 

*  7 

131,072 

92,334 

8 

589,824 

434,660 

127 


Table  Al-2. Dunbar  of  combinations  of  Inputs  that  produce  "ones"  In  each 

bit  position  of  the  answer  resulting  from  the  logically-reduced 
direct  binary  addition  of  two  n-blt  numbers  with  an  input  carry, 
corresponds  to  the  least  significant  digit,  etc. 


n 

A1 

A2 

A3 

A4 

A5 

A6 

*7 

*9 

*AI 

1 

4 

3 

- 

- 

- 

- 

- 

- 

- 

7 

2 

4 

12 

7 

- 

- 

- 

- 

- 

- 

23 

3 

4 

12 

28 

15 

- 

- 

- 

- 

- 

59 

4 

4 

12 

28 

60 

31 

- 

* 

- 

- 

135 

5 

4 

12 

28 

60 

124 

63 

- 

- 

- 

291 

6 

4 

12 

28 

60 

124 

252 

127 

- 

- 

607 

7 

4 

12 

28 

60 

124 

252 

508 

255 

- 

1243 

8 

4 

12 

28 

60 

124 

252 

508 

1020 

511 

2519 

130 
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Table  Al-5.  Humber  of  combinations  of  inputt  that  produce  "cnee"  in  each 
bit  poaition  of  the  answer  resulting  from  the  binary-coded 
residue  arithmetic  addition  of  two  numbers.  The  input 
numbers  are  represented  by  their  residues  with  respect  to 
moduli,  M. 


M 

A1 

A2 

A3 

A4 

iAl 

2 

2 

- 

- 

- 

2 

3 

3 

3 

- 

- 

6 

4 

8 

8 

- 

- 

16 

5 

10 

10 

5 

- 

25 

6 

18 

12 

12 

- 

42 

7 

21 

21 

21 

- 

63 

9 

36 

36 

36 

9 

117 

11 

55 

55 

44 

33 

187 

13 

78 

78 

65 

65 

286 

5V7/ 
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Table  Al-6 .  Number  of  combinations  of  Inputs  that  produce  "ones"  In  each 
bit  position  of  the  answer  resulting  from  the  logically- 
reduced  binary-coded  residue  arithmetic  addition  of  two 
numbers.  The  Input  numbers  are  represented  by  their  residues 
with  respect  to  various  moduli,  M. 


M 

A1 

A2 

A3 

A4 

*A, 

2 

2 

- 

- 

- 

2 

3 

3 

3 

- 

- 

6 

4 

2 

6 

- 

- 

8 

5 

8 

6 

S 

- 

19 

6 

8 

9 

9 

- 

26 

7 

12 

12 

12 

- 

36 

9 

20 

17 

18 

9 

64 

n 

23 

29 

22 

18 

92 

13 

33 

29 

31 

26 

119 
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Table  Al-7.  Nuaber  of  combination*  of  input*  that  produce  "one*"  in  each 
bit  position  of  the  answer  renulting  from  the  binary-coded 
residue  arithmetic  multiplication  of  two  number*.  The  input 
number*  are  represented  by  their  residue*  with  respect  to 
moduli ,  M. 


A1 

*2 

A3 

A4 

IAI 

1 

- 

- 

- 

1 

2 

2 

- 

- 

4 

4 

6 

- 

- 

10 

8 

8 

4 

- 

20 

9 

11 

8 

- 

28 

18 

18 

18 

- 

54 

16 

24 

28 

- 

68 

30 

36 

30 

6 

102 

50 

50 

40 

30 

170 

72 

72 

60 

60 

264 

s-m 
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Table  Al-8.  Number  of  combinations  of  Inputs  that  produce  "ones"  In  each 

bit  position  of  the  answer  resulting  from  the  logically-reduced 
binary-coded  residue  arithmetic  multiplication  of  two  numbers. 
The  Input  numbers  are  represented  by  their  residues  with  re¬ 
spect  to  various  moduli,  M. 


H 

A1 

A2 

A3 

A4 

IA, 

2 

1 

- 

- 

- 

1 

3 

2 

2 

- 

- 

.4 

4 

1 

4 

- 

- 

5 

5 

7 

4 

4 

- 

15 

6 

4 

9 

6 

- 

19 

7 

6 

6 

6 

- 

18 

8 

1 

4 

9 

-■ 

14 

9 

17 

17 

15 

6 

55 

11 

22 

22 

21 

19 

84 

13 

34 

25 

29 

28 

116 
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APPENDIX  2 


Xhe  PhQtQEsfractive  Effect 

The  principles  of  the  optical  processing  systems 
presented  in  this  thesis  are  predicated  on  thick  holographic 
recordings.  The  holographic  recording  material  chosen  for 
the  thesis  experiments  was  crystalline  lithium  niobate 
(LiNbOj) .  The  mechanism  in  lithium  niobate  responsible  for 
recording  holographic  fringes  is  the  photoref ractive  effect. 

The  photoref ractive  effect  is  optically-induced 
changes  in  the  index  of  refraction  of  a  material.  The 
effect  results  from  the  combination  of  two  processes: 
optically-induced  charge  migration,  followed  by  electrooptic 
modulation  of  the  index  of  refraction.  The  dominant  cause 
of  photorefractive  charge  migration  in  lithium  niobate  is 
electron  transport  produced  by  the  bulk  photovoltaic  effect. 
Other  possible  causes  of  charge  migration  are  concentration 
gradient-induced  diffusion  and  electric-field-induced  drift. 

The  bulk  photovoltaic  effect  may  occur  in  all 
crystals  with  structures  lacking  inversion  symmetry. 
Previous  opinion  held  that  a  crystal  must  be  ferroelectric 
to  exhibit  the  bulk  photovoltaic  effect,  but  bismuth  silicon 
oxide  (BSO)  has  provided  a  counter  example.  That  crystal  is 
not  ferroelectric,  due  to  the  cubic  structure  cf  its 


lattice,  but  is  weakly  bulk  photovoltaic.  Tne  bulk 
photovoltaic  ef£ect  has  also  been  called  "anomalous 
photovoltage"  or  "photogalvanic  effect." 

The  bulk  photovoltaic  effect  was  first  described  by 
Glass  et.  al.  [45]  in  1974.  It  is  a  stationary  bulk 
phenomenon  and  is  characterized  by  a  short-circuit  current 
that  is  produced  under  uniform  illumination.  It  is  <* 
anisotropic  effect  described  by  a  third  rank  tensor 

relation.  The  bulk  photovoltaic  effect  differs  from 
ordinary  photovoltaic  effect,  seen  in  P-N  junction  devices, 
in  that  the  current  density  is  not  expressable  as  the 
gradient  of  an  electrochemical  potential  of  the  charge 
carriers.  The  open-circuit  voltage  developed  by  the  bulk 
photovoltaic  effect  is  not  limited  by  the  band  gap  of  the 
material.  In  iron-doped  lithium  niobate,  the  open-circuit 
voltage  may  approach  10  kilovolts.  The  short-circuit 
current  density  is  about  J=10-10  Amps/mm2  for  an  incident 
optical  intensity  of  0.01  watt/mm2.  The  current  density  is 
given  bys 


J  “  kbI 


(A2-1) 


where  *  is  the  photovoltaic  coefficient  (sometimes  called 
the  Glass  constant) ,  a  is  the  optical  absorption,  and  I  is 
the  optical  intensity. 

The  origin  of  the  bulk  photovoltaic  effect  was 
recognized  by  Glass  to  be  spatially  asymmetric  electronic 


transition  probabilities  that  result  from  the  asymmetric 
potential  energy  well  associated  with  an  electron  donor 
impurity. 

For  lithium  niobate,  the  probabilty  of  an  excited 
electron  moving  in  the  optic  axis  direction  is  greater  than 
that  of  it  moving  in  the  opposite  direction.  The  optic  axis 
is  oriented  perpendicular  to  the  fringes  of  the  light 
interference  pattern  that  is  produced  by  the  object  and 
reference  beams  during  recording.  Electrons  are 
photoexcited  in  the  bright  regions  of  the  fringe  pattern. 
They  are  driven  by  the  bulk  photovoltaic  effect  in  a 
direction  along  the  optic  axis.  After  travelling  a  distance 
determined  by  their  recombination  lifetime,  the  electrons 
become  retrapped,  if  retrapping  occurs  where  the  light 
intensity  is  small,  the  probability  of  re-excitation  will  be 
low.  since  the  bright  and  dark  fringes  are  approximately 
one  optical  wavelength  apart,  the  electron  migration  occurs 
very  rapidly.  The  spatial  variation  of  electron 
concentration  produced  is  a  replica  of  the  light  intensity 
interference  fringe  pattern.  The  electric  field  associated 
with  this  space  charge  pattern  modulates  the  index  of 
refraction  of  the  material  through  the  electrooptic  effect. 
The  refractive  index  modulation  also  mimics  the  light 
interference  pattern  and  forms  a  phase  hologram. 

Crystals  tnat  exhibit  the  bulk  photovoltaic  effect 
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also  exhibit  the  linear  electrooptic  effect.  Both  effects 
require  a  lack  of  inversion  symmetry  in  the  crystal 
structure.  Crystals  that  have  inversion  symmetry  also  can 
exhibit  the  photoref ractive  effect.  For  such  crystals, 
charge  transport  is  by  diffusion  or  by  drift  due  to  an 
electric  field.  The  quadratic  electrooptic  effect  modulates 
the  index  of  refraction  in  these  cases. 

For  the  linear  electrooptic  effect  found  in  lithium 
niobate,  the  amplitude  of  the  refractive  index  modulation 
for  light  propagation  in  the  y-axis  direction  and  light 
polarization  in  the  z-axis  (optic  axis)  direction  is  given 
byi  * 

C  An  -  -n^r33El/2  (A2-2) 

where  nE  is  the  principal  extraordinary  index  of  refraction, 
tj3  is  the  appropriate  electrooptic  coefficient  for  this 
geometry,  and  E(  is  the  amplitude  of  the  space  charge  field. 
The  resulting  diffraction  efficiency  from  Kogelnik's 
first-order  two-wave  coupled-wave  theory  is  given  by: 

DE  “  »in^(nAnd/Xcos8)  (A2-3) 

where  d  is  the  thickness  of  the  crystal,  x  is  the  fieespace 
wavelength,  and  e  is  the  angle  of  refraction  inside  the 
crystal  for  the  Incident  reconstruction  beam.  For  0.1% 
(  diffraction  efficiency  holograms  and  typical  laboratory 


5-Ht 


parameters,  the  refractive  index  modulation  amplitude  is 
an«10~*  .  This  corresponds  to  an  electric  field  amplitude  of 
E  »  6x1©’  voltd/m 

An  estimate  of  the  number  of  superposed  holograms 
that  may  be  recorded  can  be  obtained  by  assuming  that  at 
some  point  in  the  material,  all  of  the  index  modulations  are 
in  phase  and  add  to  use  the  material's  entire  available 
dynamic  refractive  index  range,  AN.  The  number  of  possible 
superposed  holograms  would  then  be  given  by  an/ an.  For 
lithium  niobate,  AN*10~3  .  Thus  for  the  practical  example 
given,  the  maximum  number  of  holograms  that  can  be  recorded 
is  about  1000. 
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Dependent  and  Independent  Conetreinte  for  e  Multiple 
Objective  Iterative  Algorithm 

Joeeph  H.  Mait  end  William  T.  Abode* 

Georgia  Institute  of  Technology 

School  of  Electrical  Engineering 
Atlanta,  Ceorgia  30332 

Constrained  iterative  algorithm*  have  been  applied  primarily  to  single 
objective  application*,1  where  by  objective  we  mean  that  distribution  that  ie 
to  be  reconstructed  from  partial  information  or  synthesised  with  desired 
characteristics.  In  a  recent  work  by  the  authors,2  Fienup's  error-reduction 
algorithm  was  extended  to  multiple  objectives,  as  shown  in  Fig.  1,  and  was 
applied  to  a  specific  synthesis  problem  with  two  objectives.  In  this  paper, 
we  illustrate  an  important  characteristic  of  multiple  objective  iterstive 
algorithms,  dependent  constraints. 

Our  problem  is  the  synthesis  of  two  pupil  functions  ?j(u)  and  ?2(u),  for 
use  in  the  incoherent  optical  apatial  filtering  system  in  Fig.  2,  such  that  a 
desired  bipolar  spatial  impulse  response  or  point  spread  function  (FSF) 
results.  The  effective  pupil  function  Ku;A|,Aj,f)  of  the  optical  system  in 
Fig.  2  is  given  by3 

Ku-.Aj.Aj.d)  -  A.PjU)*^*  AjFjtu),  Hi 

and  the  corresponding  FSF  f(x;A^,A2,t)  by 
f(x;A1(A2,d)  -  |p(x;Aj ,Aj ,♦) |2 

-  Aj|pj(*)|2  ♦  A^|p2U)|2  (2) 


♦  AjAjIpjixJpjCxle3*  ♦  p*(x)p2(x)e  ^*1 

where  the  pupil  function  F(u)  and  the  coherent  spread  function  (CSF)  p(x)  form 
a  Fourier  transform  pair.  A  desired  bipolar  FSF  £(x)  may  be  synthesised 
through  control  of  transmittance  factor*  Aj  and  Aj  and  phase  ♦  . 

Lohmann  and  Rhodes  identify  two  distinct  regime*  for  implementing  bipolar 
FSFs  in  this  way,  pupil  interaction  and  pupil  noninteraction. *  The 
synthesised  FSF  f#(x)  resulting  from  pupil  noninteraction  is  given  by 

f#(x)  -  A2|pj(x)|2  -  A^|p2(x)j2,  (3> 

and  for  pupil  interaction  by  (where  dj(x)  “  arg  {pi(*)}»  1*1,2) 

f^tx)  ■  2A^A2|pj(x)pj(x)  HcoslS^  ♦  *j(x)  -  djU))  (A) 

-  C0»[*b  ♦  #l<x)  -  *2(x)Jj 


(cr! 
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where  *  and  *.  ara  two  different,  but  fixed,  values  of  phaae  ♦  in  Fig.  2. 

A  D 

Since  it  ia  the  pupil  functions  Pj(u)  and  Pj(u)  that  describe  the  system, 
they  are  our  objective  functions.  From  a  practical  optical  standpoint,  the 
pupil  functions  must  be  of  finite  extent;  thus  we  desire  the  following  of  our 
systea: 


I.  synthesis  of  a  bipolar  FSF 

ft(x)  -  f(x),  where  f(x)  is  the  desired  bipolar  FSF. 

II.  finite  extent  pupil  functions 

P1  2(u)  “  P1  2<u)  rect  0,-**rev  i§  th*  e*tent  #f  the  PuPil‘ 

With  respect’ to  Eqs.  (3)  and  (4),  Condition  I  is  a  dependent .  or  mutual 

constraint,  since  both  pj(x)  and  p2(x)  must  jointly  satisfy  the  constraint. 

This  is  in  contrast  to  Condition  II,  where  the  constraint  on  Fj(u)  in  no  way 

determines  or  affects  the  constraint  on  F2(u).  Understandably,  greater 

freedom  exists,  and  more  ingenuity  may  be  required,  to  satisfy  a  dependent 

constraint  as  opposed  to  an  independent  constraint  because  there  are  no 

explicit  constraints  on  the  objectives. 

By  definition  of  the  error-reduction  algorithm,  a  point  not  satisfying 
the  domain  constraints  is  replaced  by  a  point  that  satisfies  the  constraint 
and  ie  a  minimum  distance  from  the  original  point.1  Figure  3  is  a  vector 
diagram  depicting  the  minimum  changes  necessary  to  assure  Condition  I.  It  is 
assumed  that  the  desired  FSF  fix)  is  dependent  equally  upon  pj(x)  and  p2(x); 
thus  modif ications  to  one  are  equal  and  opposite  to  modifications  of  the  other 
as  long  as  Condition  I  is  msintsined.  Condition  I  is  therefore  satisfied  for 
the  pupil  noninteraction  regime,  assuming  4j  ■  Aj  »  1,  by 


!*!<*>  I 
If2<x)  I 


■\Li  i&EjjpE**;. 
■jm,  '>■<»>'* 


(5a) 

(5b) 


where  POSig(x)]  is  a  half-wave  rectification  of  g(x).  The 
undisturbed.  For  the  interaction  regime,  assuming  the  modified 
o  e^6,  2A^Aj  ■  1/2,  *a  -  0,  and  *b  -  B,  Condition  I  is  satisfied  by 


•j(x) 


♦jU)  ♦  d2(x)  ♦  • 


phase 

point 


is 

is 


(o'. Z 


(6a) 
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,  #.(*)  ♦  *,(*)  -  • 

♦  2t*>  -  — - y - •  (6b> 


Algorithms  implemented  using  Eqs.  (5)  and  (6)  were  toted 

experimentally.  With  the  bandpass  filter  in  Fig.  4  aa  the  deaired  bipolar 

PSF,  Fife.  5  and  6  repreaent  pupil  noninteractive  and  pupil  interactive 

synthesis  of  the  FSF  after  100  iterationa  uaing  Eqs.  (5)  and  (6), 

respectively.  The  no realized  equared  error  ia  0.0875  for  the  noninteractive 

regiae  and  0.5137  for  the  interactive  regime. 

The  high  error  for  the  interactive  regiae  may  reault  froa  the  great 

amount  of  freedom  the  algorithm  preeenta;  although  the  error  ia  reduced  with 

each  iteration,  the  reduction  ia  alight.  For  thia  reaaon,  the  algorithm  vaa 

modified  to  force  the  ayntheaised  point  equal  to  the  deaired,  the  addition  of 

&2  to  fg  in  Fig.  3b,  aa  oppoaed  to  altering  ita  projection  onto  the  real 

axis,  the  addition  of  .  In  the  limit  of  a  large  number  of  iterationa, 

A  and  A.  ahould  be  equal.  The  reaulta  of  thia  algorithm  are  preaented  in 
12 

Fig.  7.  The  normalised  error  ia  0.0276. 
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A  new  optical  method,  based  on  continuous-tisie  relaxation  methods,  is  presented  for 
implicitly  inverting  the  estimate  of  the  covariance  matrix  associated  with  a  set  of  signal 
waveforms.  Complex  valued  signal  information  in  conveyea  by  biased  temporal  frequency 
carriers,  a  resonant  electro-optic  device  serving  both  to  evaluate  the  covariance  matrix  end 
(as  a  spatial  light  modulator,  or  SLM)  to  input  that  matrix  for  inversion. 

Introduction 

We  present  here  the  theory  underlying  a  new  incoherent  optical  approach  to  inverting 
the  covariance  matrix  estimate  associated  with  a  set  of  signal  waveforms.  The  scheme,  based 
on  a  continuous-time  algorithm  ana  using  biased  temporal-frequency  carriers  to  convey 
bipolar  or  complex-valued  signal  information,  has  certain  potential  advantages  over 
previously  reported  discrete  Iterative  approaches!  (1)  all  computation  is  analog,  and  A/O 
and  0/A  bottlenecks  are  thus  eliminated  at  all  stages  of  processing,  and  (2)  the  method 
avoids  dynamic  range  complications  that  might  arise  from  biased  real-imaginary  component  or 
three-component  representations  of  complex  signals. 

We  begin  by  presenting  the  essential  features  of  the  relaxation  method  for  implicit 
matrix  inversion  and  then  describe  modifications  appropriate  for  icplementatlon  with 
complex-valued  signals  conveyed  on  temporal  frequency  carriers.  Wo  then  present  the  basic 
a.c.  electro-optical  processor  concept.  Finally,  potential  difficulties  with  the  concept 
are  discussed,  along  with  suadif icationa  necessary  to  correct  them. 

Matrix  Inversion  by  Relaxation  Method 

Assume  the  matrix  equation 
v  hx 

(1) 

is  to  be  implicitly  inverted,  i.e.,  vector  jk  is  to  be  solved  for  in  terns  of  x*  (Consis¬ 
tently  throughout  this  paper,  lower-case  letters  with  underbara  denote  vectors,  and  upper¬ 
case  letters  with  underbars  denote  matrices.)  Incoherent  optical  processors  have  been  built 
that  solve  for  x  via  the  iterative  operation 

**♦1  *  X  +  <1  -  !»>**.  (2) 

where  X  denotes  the  identity  matrix  (1J.  Complex  arithmetic  has  been  implemented  using 
nonnegative-real  three-component  or  biased  real-imaginary  representations  for  the  matrix  and 
vectors,  allowing  optical  implementation  uaing  light  intensity  as  the  carrier  of  informs- 
t  lone 

alternative  method  for  implicit  inversion  of  the  matrix  is  based  on  contlnuous-tiste 
•Iflorlthms.  Such  methods  wers  recently  proposed  for  electro-optic  implementation 
by  Cheng  and  Caulfield  (21.  The  basic  idea  is  illustrated  by  the  system  of  Figure  1,  which 
is  configured  to  solve  the  pair  of  equations  (all  values  can  be  complex) 

*1  ’  “ll'l  ♦  "12*2  Oa> 

y2  -  mJ1«1  ♦  mMx2  (3b> 

f?*;  *1'  *2  ?‘v*n  Xl>  y £--i.e.,  to  solve  for  vector  x  in  Eq.  (1).  Inspection  <>»  the  figure 
,  *°  configured  that  If  equilibrium  is  attained,  then  x,  and  x;  must 
satisfy  Eq.  (3).  Generalixation  to  the  NxN  case  is  straight-forward. 
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As  discussed  in  (2),  the  systss  of  Figure  i  is  stsbie  only  when  matrix  jk  is 
positive-definite,  i.e.,  when  sli  the  eigenvalues  of  H.  hav*  reel  parts  that  are  positive 
(see,  e.q. •  (3),  Ch.  6).  The  significance  of  this  condition  is  illustrated  by  the  simple 
scalar  (non-vectorial)  example 

y  -  mx,  («) 

where  the  objective  is  to  determine  scalar  x  given  scalars  y  and  m.  All  three  quantities 
can  be  complex-valued.  A  conditionally  stable  system  for  solving  this  equation  is  shown  in 
figure  2.  Taking  Laplace  transforms  of  the  different  quantities  in  the  aysten  of  Figure  2 
and  rearranging  tarns  we  can  express  system  operation  in  terms  of  the  equation  (assuming 


gain  G  ■  1) 

X(s>  •  (l/(s+m)|Y(s).  (S) 

To  investigate  the  behavior  of  the  system,  we  let  y(t>  be  given  by 

y(t)  -  T0u(t),  (() 

where  u(t)  is  the  unit  step  function  and  yQ  is  a  constant.  The  Laplace  transform  of  y(t) 
is,  under  these  conditions,  given  by  (l/s)Y0,  and  Eq.  (5)  then  becomes 

X(s)  -  (l/lstmtltvys).  (7) 

Partial  fraction  expansion  and  retranaformatlon  leads  immediately  to  the  solution 

x(t)  -  (Y0/m)(l  -exp(-mt)luU).  (8) 

So  long  as  a  satisfies  the  condition 

Re(m)  >  0,  (9) 

x ( t )  converges  for  large  t  to  the  desired  solution  x  »  1Q.  However,  if  Re(m)<0,  x(t) 
continues  to  grow  and  equilibrium  is  never  reached. 


It  is  useful  to  us  later  if  we  replace  the  integrator  in  the  system  of  Figure  2  with  a 
simple  RC  filter.  This  is  effected  analytically  if  the  factor  1/s  in  Eq.  (5)  is  replaced  by 
the  factor  l/(s+b),  where  1/b  is  the  time  constant  of  the  low-pass  filter.  Assuming  y(t)  - 
Y0u(t),  the  resultant  time-domain  equation  is 

x(t)  -  lY^'tn+bl)  (1  -  exp(-(m+b)  t)  )u(t).  (10) 

In  order  for  x(t)  to  approach  the  desired  solution  Yg/m  for  large  t  it  is  necessary  that 

Re(m*b)  •  b  ♦  Re(m)  >  0,  (11) 

and  b  must  satisfy  the  condition 

b  <  <  | m | .  (12) 

Going  back  to  the  matrix  case,  as  noted  above,  the  system  of  Figure  1  (or  its  generali¬ 
sation  for  problems  of  higher  dimensionality)  is  stable  only  if  matrix  n.  is 
positive-definite,  meaning  that  all  eigenvalues  of  M  have  positive  real  parts.  This  condi¬ 
tion  reduces  to  condition  (9)  for  the  non-vectorTal  case.  Fortunately,  the  covat lance 
matrix  associated  with  a  set  of  signal  waveforms  is  Indeed  positive-definite  and,  in  fact, 
heraltla.n,  implying  that  its  eigenvalues  are  both  real  and  positive  (see,  e.g.,  p.  249  of 
Ref.  (4)  ).  As  a  consequence,  non-osc History  convergence  of  the  relaxation  method  is  assur¬ 
ed,  at  least  in  theory,  for  the  problem  of  interest  to  us.  Were  H,  not  positive-definite,  it 
would  be  necessary  to  go  to  a  more  complicated  relaxation  processor,  as  discussed  in  (2). 

Temporal  Frequency  Carriers  for  Complex  Signal  Representation 

The  aystem  of  Figure  1  can  be  modified  to  operate  with  coaplox-valued  vectors  and  ma¬ 
trices  using  real  signal  representations.  For  Incoherent  electro-optic  lmplesentatlon, 
nonnegat ive-rea 1  representations  of  the  quantities  are  required!  with  varying  amounts  of 
computational  overhead  such  methods  as  three-component  and  biased  real-imaginary  encoding  of 
complex  numbers  can  b»  used  (see,  e.g.,  15)). 


Of  interest  to  us  in  this  note  is  a  nonnegative-real  reprasentatlon  based  on  temporal 
frequency  carriers.  We  introduce  the  idea  by  again  using  the  scalsr  case  of  Eq.  (4)  as  sn 
example,  let  coaplex  signals  s,  m,  and  y  be  given  by 
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«  “  |a|««p()(a) 

y  ■  ly|«*ptl*y» 

■  *  (■lespus  i. 


•a  ■  srg(s)  (14a) 

•y  -  «9ly)  (14b) 

•B  *  arg(a).  (14C) 

Thcsa  cosplsx  quantities  ara  represented  in  tha  ayatsm  of  Pigure  3  by  tha  tlae  waveforms 

|x(t)|cos(wQt  ♦  SR(t))  (15a) 

|y(t)|co»(«0t  ♦  4y(t)>  U5b) 

|a(i)|cos(«0t  ♦  §B(t))  (15c) 

Tha  bandpaaa  filtara  (BPP)  ara  aaauaad  to  hava  unity  qaln  and,  for  now,  linear  phaae  charac- 
taristica  over  the  bandwidth  of  concarn.  in  rlgura  3,  algnal  |x(t)|coft(*0t*t _ ( t ) ) 
repraaanta  an  aatlaata  of  tha  aolutton  of  Eq.  (4).  Tha  magnitude  |x|  and  phaaa  4  vary 
alowly,  in  accord  with  tha  bandwidth  of  tha  narrowband  filter  (Mar).  * 

It  can  he  ahown  that  the  wavafora  |a(t)|coa(w  t  a  a  ( t > )  aattlea  down  to  tha  dealred 
reault  ( |y |/|n|coa(u  t  ♦  4  -  4a)  ,  to  within  an  error  determined  by  aaplifler  gain  G  and  by 

a pacific  charact  rial lea  of  tha  hlgh-0  filter,  ao  long  aa  the  etabillty  condition 

Re(a)  •  ;  s|coa4a  >  0  (jj) 

la  aatlafied.  To  illuatrata  tha  baalc  idea  wa  preaant  the  following  graphical  example.  By 
Inspection,  the  negative  input  to  the  differential  aaplifler  la  given  by 
|*a|coa(w0t  ♦  4a  ♦  4a)  .  Tha  output  of  the  differential  aaplifler  la  thua 

Gl|ylcoeUot  ♦  4y)  -  |*a|coa(w0t  •»  4^  ♦  4^)).  (17) 

The  Input  and  output  of  the  differential  aapllftar  ara  ahown  in  phaaor  fora  in  Pigure  4, 
where  it  haa  been  aaauaed  that  C-l.  Tha  difference  algnal,  represented  by  phaaor 

ly|«*Plj«yl  -  | aa|exp(} (4^  ♦  4B)J,  (lt) 

is  applied  to  the  hlgh-0  filter.  Froa  Pigure  4(c)  it  la  clear  that  this  algnal  haa  argnl- 
tude  and  phaaa  appropriate  to  drive  the  output  of  the  hlgh-0  filter  in  the  direction  of  the 
solution  |y/a|exp  (J(4y  -  4^)1. 

The  signals  of  Figure  3,  although  real,  are  bipolar)  l.e. ,  they  are  not  nonnegative. 
However,  as  demonstrated  in  the  following  section,  bias  signals  can  be  added  where  necessary 
to  allow  electro-optical  lapleaentatlon.  The  bias  signals  era  easily  filtered  out  in  elec¬ 
tronic  aubaectlona  to  slapllfy  overall  systea  operation. 

In  the  following  discussions  wa  shall  astuae  that  the  hlgh-0  filter  of  Pigure  3  is 
characterized  by  a  single  coaplex  pole  pair.  (In  practice  the  filter  would  probably  be 
Implemented  by  means  of  a  pair  of  slaple  hC  filters  in  lnphase  (cosine)  and  quadrature 
(sine)  channels  of  a  baseband  equivalent  filter.  See  Sec.  S.1.3  of  Ref.  (4)).  Under  these 
circumstances.  In  terms  of  the  phaaor  representation,  sytea  operation  is  the  aaaa  as  that  of 
the  sytea  of  Pigure  2  with  the  integrators  replaced  by  RC  filters. 

Basic  *.  C.  Electro-Optic  Processor  Concsot 

, _ **  this  describe  the  basic  principles  of  ar.  a.c.  electro-optical  processor 

n,»  E4<**  th*  r,1***tlon  method.  The  key  to  systea  operation  la  an  SLR  that 

?n  “"*y  °*  "4‘h  the  (l,J)th  cell  being  characterised  by  a  tlae- 

-varylng  light  intensity  transmittance  of  the  fora 

T1J  “  *  4  l*lj!c0,(,'ot  ♦  »!!»•  (19) 

shA1"-"4!  ",  l*ljl**P*3*ljl  •  IUi  B  and  aatris  eleaent  aagnltude  |b<^|  are  noraaltsed 
O  tas  a  value  between  tero  and  unity.  In  order  to  assure  systea 


stability.  JJ.  Is  assumed  to  b#  positive-definite. 

Ovsrall  system  operation  is  shown  by  the  block  diagram  ef  ri;iir*  S.  Certain 
alut-flUtr  coablnations  have  basn  aada  unnecessary  in  tha  digaram  by  tha  use  of  complex 
representations  for  the  signals.  Zn  tha  diagram  b  and  £  represent  respectively  a  constant 
bias  vector  and  a  constant  bias  matrix.  Tne  quantltltes  x,  y,  and  intermediate  quantltites 
are  vectorial  In  nature,  as  indicated  by  the  bold  signal  flow  arrows.  The  output  of  the 
electro-optic  matrix-vector  multiplier  subsection  is 

l Ifi  ♦  hex  pt J«t] lb  ♦  xsxpljw  tj ) 


“(Mx)  explJ2*t)  a  (Mb  a  Bx)exp( j«&tl  a  Bb, 

only  the  first  term  of  which  is  passed  by  the  bandpass  filter.  The  optical  matrix-vector 
multiplier  Is  in  most  respects  like  the  system  described  by  Goodman  at  al.  IS].  However, 
as  indicated  above  and  discussed  below,  each  complex  matrix  element  mt .  is  conveyed  on  a 
temporal  frequency  carrier,  as  in  Eq.  (19).  * 

The  2-D  SLM  representing  K.  is  made  up  of  a  2-0  matrix  of  individual  resonant 
plexoelectro-optlc  light  modulators  It],  A  resonant  plexoelectro-optlc  modulator  la  simi¬ 
lar  in  many  respects  to  an  ordinary  linear  electro-optic  modulator,  but  it  is  operated  at 
the  plesoelectrlc  resonance  frequency  of  the  crystal.  At  this  frequency,  shear  and  thick¬ 
ness  mode  vibrations  of  the  crystal  result  In  large  sinusoidal  fluctuations  of  the  birefrin¬ 
gence  of  the  crystal  with  the  application  of  relatively  small  a.c.  voltages.  If  placed 
between  crossed  polarizers  and  driven  at  the  resonance  frequency  »0  ,  the  modulator  produces 
fluctuations  in  light  intensity  transmittance  T  In  accord  with  the  equation  (61 

T  *  sln2((l/2)«cosuot],  (21) 

where  *  ,  the  peak  blrefringent  retardation,  is  directly  proportional  to  the  amplitude  of 
the  applied  sinusoidal  voltage. 

Two  characteristics  of  the  resonant  piesoelactro-optlc  effect  are  of  special  interest 
to  us.  rirst.  It  is  an  a.c.  effect  suitable  for  complex  processing.  Depending  on  the  cry¬ 
stalline  material  used,  resonant  frequencies  in  ths  range  O.S-500  KHz  appear  to  be  easily 
obtained  (6,7).  Of  equal  Importance,  because  of  the  mechanical  resonance,  voltages  required 
to  achieve  significant  changes  In  transmittance  T  are  typically  several  orders  of  magnitude 
lower  than  those  required  for  the  d.c.  linear  electro-optic  (Fockels)  effect.  Specifically, 
a.c.  voltages  in  the  5-20  volt  range  are  adequate  in  reported  cases  for  high  efficiency 
modulation  16,7]. 

For  optical  processing  purposes  we  want  device  transmittance  T  to  vary  linearly  with 
the  applied  signal.  If  the  crystal  used  has  some  natural  birefringence  or  is  illuminated  by 
light  of  the  appropriate  elliptical  polarization,  tho  operating  point  of  ths  device  can  be 
shifted  such  that  T  has  the  form 

T  -  sin2  [(•/«>  ♦  (S/2ICOS  *  t].  (22) 

Theoretical  considerations  show  that  for  an  applied  driving  a.c.  voltage  V  at  an  arbitrary 
frequency  w  ,  device  response  can  be  written  as 


T  •  sin2 ((•/«)  ♦  »<v/ve)cos  wt]. 


where  VQ  la  a  constant  that  appears  typically  to  be  in  the  1-20  volt  range,  and  *  is  a 
resonance  parameter  given  (in  the  vicinity  of  resonance-far  off  resonaece  T  is  essentially 
constant)  by  the  standard  resonance  function 

•  “  •(-]  ■  - - .  (24) 

|1  a  «0J  (PJ)V'2 

o 

Device  Q,  unless  deliberately  reduced,  is  typically  at  least  10J,  and  tbs  tesonance  Is  thus 
extremely  sharp. 

So  long  as  the  applied  voltage  V  is  kept  sufficiently  small,  T  can  be  written  in  the 


*  “  1/2  ♦  Mw)(V/V0)coe  wt. 


This  expression  does  not  take  into  account  any  phase  shifts  Introduced  by  the  resonance 
as  w  changes  from  »0  ,  a  point  addressed  later. 


The  hlgh-0  resonance  just  described  can  be  exploited  In  estimating  the  covariance  u- 
trls.  Adopting  the  notation  of  Rat.  (4),  the  elements  a1j  of  H.  are  given  by 

B11  "  (20 

vhara  a(<t)  la  tba  1th  complex  signal  antarlng  Into  tha  eovarlanca  matrix  calculation,  and 
<.>  danotaa  expected  valua.  In  a  digital  algnal  procaaalng  ay a tarn  £  la  aatlmatad  by  numerl- 
cally  calculating  tha  uniformly  valghtad  avaragaa 


for  tha  dlffcrant  algnal  palra.  Tha  integration  time  T  la  determined  by  overall  ayatem 
operating  paramatara.  Tha  reaonant  filter  approach  allova  tha  calculation  of  an 
exponentially  weighted  average  of  tha  form 


•li 


t 

(1/T)  / 


XjlilaJltlanpllt-tiAl  dt. 


(24) 


which,  for  the  hlgh-Q  filter  caae,  approxlmatea  (27).  figure  ((a)  illuatrataa  eatlmator 
operation  schamatlcally  for  a  alngla  pair  of  complex  algnala,  ahown  aa  coming  from  two  an¬ 
tennas)  Pigura  4(b)  shown*  tha  same  operation  in  complex  algnal  form.  In  tha  ln'.ter,  the 
resonant  filter  la  represented  by  the  tranafer  function  associated  with  ltf  baseband  equiva¬ 
lent,  which  is  a  simple  RC  filter.  The  output  of  the  filter  is  slowly  time-varying,  with 
approximate  bandwidth  1/T. 

It  should  be  emphasised  that  the  hlgh-0  filter  of  figure  4(a)  is  incorporated 
effectively  in  the  electro-optic  device  itself  and  that  its  output  la  not  an  electrical 
signal  but  rather  the  modulated  light  transmittance  T  of  Eq.  (25).  figure  7  models  the 
essentiaJ  input-output  characteristic*  in  term*  of  a  resonant  tuned  circuit  and  the 
electro-optic  transfer  characteristics.  The  input  signal  is  the  product  of  the  two  IP 
signals  of  figure  4i 

•in**’  “  l«l(H|cosI(e1  ♦  a&)t  ♦  arg(*1(t).' )  Is^l t ) |cos l»jt  ♦  arglsjtt))).  (29) 

The  filter  Is  tuned  to  the  difference  frequency  «0  .  Estimation  of  the  entire  covariance 
matrix  M  requires  that  all  signal  pair  products  be  calculated  and  applied  to  separate  elec¬ 
tro-optic  elements  In  the  matrix  SLM. 

Assuming  the  small  signal  approximation  leading  to  Eq.  (25)  is  satisfied,  the  trans¬ 
mittance  of  the  ijth.  cell  of  the  resonant  SUt  Is  given  bv  Eq.  (19),  where  m.,  is  given  by 
Eq.  (28).  The  transmittance  function  evolves  with  a  bandwidth  of  1/T,  whereai  the  relaxa¬ 
tion  algorithm  converges  at  a  rate  governed  by  the  smallest  eigenvalue  of  n,  as  noted  ear¬ 
lier.  Stability  and  convergence  can  be  Improved  If  Cram-Schmidt  orthogonallxatlon 
processing  is  performed  on  the  array  element  signals  (111,  Sec.  E.3). 

We  noted  in  conjunction  with  Eq.  (25)  that  a  phaee  term  was  ignored  in  the 
transmittance  response  T  of  the  resonant  electro-optic  modulator  to  a  sinusoidal  excitation 
at  frequency  v  *  »0  .  The  actual  phase  shift  ie  given  by 

•  lw)  -  -tan'1(2Q(^2)J,  ,30j 
o 

consistent  with  the  simple  resonance  characteristics  of  the  device.  If  we  wanted  a  propor¬ 
tional  relationship  between  the  signal  applied  to  an  SUt  cell  and  the  transmittance 
response,  thie  phase  shift  could  hurt  us.  Recall,  however,  that  the  reaonant  cell  le  being 
“*•”  to  b*  lt*  hl9h”9  filter  charecterlatlcs,  the  covariance  matrix  N.  The  phase 
of  Eq.  (30)  le  a  natural  pert  of  that  filtering  operation  and  Is  fully  consistent  with  the 
desired  result. 


ptscueelon  of  Some  Optcntlal  Problem  Areas  and  System  Modifications 


In  this  section  we  briefly  address  three  topics  related  to  practical  Implementation  of 
the  scheme i  effects  of  harmonica  generated  by  the  resonant  SUM  algnal-to-blss 
considerations)  and  a  hard-llmlter  modification  for  covariance  matrix  estimation. 


result's1**  hi,h,r"ord*r  included  In  the  approximation  of  Eq.  (25)  for  T,  the 


T  •  1/2  ♦  l(.)(V,V#)co«  at  -  (l/3)8,(a)(V/V#)JC0S,et. 


(21) 


The  third  tin  of  th*  approximation  contains  frequency  components  at  «  and  3«  .  Tha  tan 
at  a  ha*  amplitude  (1/4)  3»J(«.)  (V/V0) ■*,  1*  cubic  In  tha  signal  voltage  V.  Sine*  thla  tan 
la  at  the  a  tea  frequency  a*  tha  daalrad  transmission  modulation.  It  will  hava  a  dlract 
affect  on  tha  output  of  tha  vactor-aatrix  multiplication  operation.  The  aavarity  of  thla 
undaairad  ter:,  auat  ba  analyaad  In  order  to  datanlna  what  the  optlnua  aodulatlor.  level  la. 

Tha  blaa  b  In  tha  system  diagram  of  Figure  S  la  moat  aaally  set  equal  to  a  constant, 
sufficiently  largo  to  keep  all  vector  component*  non-negative.  However,  an  laprovaaant  In 
ayataa  algnal-to-biaa  ratio  and,  hanca.  In  avataa  dynamic  rang*  can  ba  achieved  by  allowing 
b  to  vary  aa  the  magnitude  of  the  Input  vector.  In  term*  of  the  problem  of  lntervat,  where 
Thi  Input  to  the  vector-matrix  multiplier  la  tha  current  eatlmate  of  the  vector  x,  each 
component  of  _b  would  bo  given  by  ~ 

btt  -  |X|(t||.  (12) 

To  clarify  the  point,  w*  conalder  the  acalar  problem.  The  vector-matrix  multiplication 
than  reduce*  to 

(b(t)  ♦  |x(t)|coa(»0t  ♦  *,(0)1  la  ♦  |m|coa(w0t  ♦  *m  ))).  (3JJ 

Although  b(t)  could  be  act  equal  to  the  conatant  value  max{|x(t)|).  It  la  clear  that  dynamic 
rang*  la  maximized  If  b(t)  -  |x(t)|.  for  the  blaa  la  then  Just  sufficient  to  preserve  non¬ 
negativity.  So  long  a*  the  carrier  frequency  ■  la  much  larger  than  the  effective  bandwidth 
of  |x(t)|.  the  resultant  terma  proportional  tol»(t>  can  be  filtered  out  subsequently  In  the 
electronic  part  of  the  processor.  Not*  that  matrix  bias  B  la  fixed  by  the  SLM 
characteristics  and  cannot  be  Improved. 


A  potential  difficulty  with  the  processor  aa  proposed  la  the  need  to  calculate  the 
product  of  Eq.  (29).  In  the  fora  written  thla  calculation  requires  the  ut*  of  a 
four-quadrant  multiplier,  since  both  Input*  are  modulated  In  aaplitud*.  However, 
four-quadrant  multipliers  ars  unavailable  at  the  IP  bandwidth*  generally  of  Interest,  and 
this  computation  can  therefore  present  aerloua  difficulties.  A  way  around  the  problem  In¬ 
volves  hard  limiting  on*  or  both  of  the  signals  entering  Into  Eq.  (29)i  l.a.,  stripping  off 
all  amplitude  modulation.  Under  thaa*  circumstances,  simple  balanced  mixers  can  be  used. 
Hard  limiting  Is  used  in  connection  with  the  Howells-Applebaua  method  for  phased  array 
signal  processing  to  reduce  the  dependence  of  array  performance  on  th*  strength  of  the 
external  noise  field  (Ref.  4,  sec.  S.1.5).  The  effects  of  hard  limiting  with  the  processor 
architecture  proposed  here  must  be  studied  further. 


Concluding  Remarks 

In  this  not*  we  have  presented  an  slectro-optlc  scheme  for  Implicitly  Inverting 
covariance  matrix  estimates  that  keeps  complex  modulation  information  (magnitude  and  phase) 
on  biased  sinusoidal  carriers.  Th*  temporal  carrier  method  eliminates  th*  need  for  the 
repetitive  overhead  computations  that  are  required  with  other  nonnegatlve-real 
representations.  There  appear  to  be  no  fundamental  reasons  the  scheme  should  not  work. 
Perhipi  th*  most  Important  question  to  be  addressed  at  this  tine  relates  to  th*  dynamic 
range  that  could  be  expected  of  such  a  processor.  Th*  temporal  carrier  ache**  Is  attractive 
for  Its  elegance.  Whether  it  represents  a  truly  significant  Improvement  over  schemes  based 
on  three-component  or  biased  two-component  representations  of  complex  signals  Is  a  question 
that  cannot  be  answered  without  further  study.  ’ 
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fig.  1  Syaaam  (of  aofving  2x2  matrix  invanion 
relaxation  method. 


Fig,  2  Conditionally  Habit  tyvtam  far  aohing  non-wctorial 
aquation  y  -  mx  for  x  given  y  and  m. 
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A  port  lolly  coherent  Kcchlsr-llluainatlon 
lneging  eyetoo  equipped  ulth  coopleoentery  nooks 
in  source  end  pupil  planes  coo  bo  used  to  perform 
lnoge  enhencenent  operations  ouch  as  directional 
or  non-d  tract  lone  1  edge  enhencenent  and  eapheele 
of  opotlal  bandpass  features .  With  aany  objects 
the  use  of  coaplcnentary  nooks  results  in  hlgh- 
contrsst  lnages.  Underlying  principles  srs 
explained  and  prellalnary  experinental  results 
presented. 


Introduction 

Conventional  Inaging  systcas.  particularly 
microscopes,  are  often  aodifled  In  one  way  or 
another  to  Improve  their  ability  to  lnaga  specific 
Claeses  of  objects.  Two  canopies  are  Zerolke 
phase  cootr**t  microscopy  and  dark  field  micro¬ 
scopy.  Host  aodlf lcatlou  schemes  are  based 
on  coherent  optical  laaglng  principles.1  and 
lllualnatton  Is  usually  provided  by  a  polnt-llke 
source  or  Its  equivalent.  In  this  paper  a  scheme 
la  described  that  has  much  in  coaaon  with 
spatially  Incoherent  Imaging2*3  and  chat  offers 
advantages  characteristic  of  both  coherent  and 
Incoherent  lnsglng:  high  contrast  for  the  output 
yet  the  absencs  of  coherent  artifacts  such  as  arc 
Introduced  by  dust  on  lenses-  The  stheae  Is 
Illustrated  by  way  of  easapla  In  Che  following 
section,  then  certain  general  aspects  are 
discussed  and  additional  czaaples  presented. 

An  Fxaaola  -  Bandpass  lasting 

figure  1  shows  an  laaglng  systea  of  the 
Koehler  lllualnatlon  type,  where  the  source  Is 
laaged  into  the  pupil.  The  source  is  assumed  to 
be  spatially  Incoherent  and  uniform  In  Intensity. 


Its  laags  overfills  the  aperture  by  a  sufficiently 
largo  aargln  to  sasurs  that  tbs  laaglng  operation 
la  linaar  In  wave  Intensity.*  If  a  aask  that 
contains  two  boritootally-tpacsd  pinholes  la 
placed  in  the  pupil  plane ,  the  laaglng  operation 
is  characterized  by  the  OTf  shown  la  Fig.  2. 
(andpass  structure  la  emphasised  la  the  Imago.  At 
the  saae  tine,  low  spatial  frequency  structure  and 
bias  Is  also  transmitted. 


OTf  CROSS-SECTION 


fig.  1.  larging  systea  with  Koehler-type 
lllualnatlon. 
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OTF  associated  with  two-pinhole 
pupil  mask. 


Hie  systea  Is  now  modified  by  placing  In  the 
source  plane  a  aask  that  la  transparent  except  for 
tvo  opaque  dots,  which  are  perfectly  laaged  onto 
the  two  pinholes  In  the  pupil  plane.  If  the 
object  Is  non-dlffuse,  the  effect  on  the  Image  Is 
draaatlci  bias  and  some  low  spatial  frequency 
structure  is  strongly  reduced,  while  the  bandpass 
structure  rsaslns  largely  unchanged,  figure  3 
shows  the  laags  of  a  non-dlffuse  test  target,  first 
with  conventional  incoherent  laaglng,  then  with 
the  two-plnhole  aask  In  the  pupil  plans,  and 
finally  with  both  the  pupil  plane  aask  and  the 
complementary  source  plane  aask  present.  White 
light  lllualnatlon  was  used. 

The  reason  for  the  dramatic  change  in  laags 
appearance  is  explained  by  fig.  A,  which 
Illustrates  for  the  specific  cese  where  ths  object 
consists  of  a  transparent  Mineral  "A"  In  a  uniform 
aedliia-transmlttence  background.  Light  froa  a 
particular  point  on  tbs  source  passes  through  the 
object  and  Is  focused  onto  ths  pupil  plans.  Ths 
geoastry  of  the  systea  Is  such  that  ths  Fraunhofer 
pattsrn  of  ths  object  Is  projected  octo  ths 
pinhole  aask.  The  flguse  shows  ths  Fraunhofer 
pattsrn  occupying  one  particular  position  relative 
to  ths  pinholes. 
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Image  of  non-diffuse  test  target:  (a)  conventional  incoherent  image;  (b) 
image  obtained  with  two-pinhole  mask  in  pupil  plane;  (c)  image  obtained  with 
two-pinhole  pupil  plane  mask  and  complementary  source  plane  mask. 

J  prevented  free  passing  through  to  the  image  plane. 

W  y  The  result  is  an  image  of  inc reseed  contraat  and 

I  //*  (because  of  the  finite  diameter  of  the  pinholes) 

I  reduced  low  spatial  frequency  content.  The 

^ _  removal  of  the  undiffracted  light  aakea  the  acheae 

*a— similar  to  dark  ground  microscopy  or  schlleren 

^i|  imaging.  However,  system  operation  is  euch  more 

jtfr  IJ  complicated  because  of  the  extended  source  and 

q  /  o  f  tvc-pinhole  mask. 

Because  the  system  la  now  partially  coherent. 
Fig.  4.  Diffraction  pattern  from  numeral  it  la  not  possible  to  describe  the  imaging 

*4*  incident  on  two-pinhole  mask,  operation  in  terms  of  a  transfer  function:  the 

as  produced  by  Off-axis  source  imaging  la  linear  neither  in  wave  amplitude  nor  in 

point.  wave  intensity.  Nevertheless,  qualitatively.  It 

la  not  unreasonable  to  think  of  the  resultant 

Each  and  every  point  on  the  spatially  image  as  being  a  bandpass  filtered  version  of  the 

incoherent  extended  source  projects  a  object  intensity  transmittance  modified  by  a 

corresponding  Fravnhofer  pattern  on  the  mask.  If  reduction  lw  bias, 

the  source  la  perfectly  uniform,  these  Fraunhofer 

patterns  occupy  all  possible  positions  relative  to  ,  Some  Central  Character 1st  lea  of  the  Scheme 

the  pinholes.  For  each  position,  the  light  is 

"sampled**  by  the  pinholes  and  a  modulated  fringe  If  the  objective  of  the  scheme  la  to  perform 

pattern  la  produced  in  the  image  plane.  The  spatial  fllterlng-like  operations  on  the  object 

superposition  of  theae  fringe  patterns  la  the  intensity  transnlttance  while  at  the  same  time 

actual  image  observed.  (The  fringe  patterns  add  Improving  the  contrast  of  the  image,  then  it  la 

on  an  Intensity  basis  because  of  the  spatial  desirable  that  the  pupil  plana  mask  (which 

incoherence  of  the  source.)  determines  the  "filtering**  characteristics  of  the 

system)  and  tha  source  plane  mask  be  largely 

Now  assume  that  tha  source  plane  mask  la  complementary  to  each  other:  where  one  has  a  high 

present— a  mask  that  la  complementary  to  tha  pupil  transmittance,  the  other  should  be  nearly  (or 

plana  mask.  In  this  case,  the  Fraunhofer  patterns  totally)  opaque,  and  vice  versa.  This  condition 

occupy  all  possible  positions  in  the  pupil  plane  assure*  that  light  that  la  not  diffracted  by  tha 

except  those  two  positions  that  would  place  the  object  peasea  through  to  tha  image  plana  only  with 

seroth  diffraction  order  (dc  spot)  associated  with  vary  low  amplitude.  It  would  appear  that  beat 

toe  pattern  directly  on  top  of  a  pinhole.  In  most  contrast  la  obtainable  wtth  binary  masks,  i.e., 

regards  the  resultant  image  is  tha  same  as  before.  masks  that  are  either  (ally  transparent  or  fully 

Nowever,  for  a  relatively  non-dlffuee  object,  much  opaque.  However,  binary  masks  may  not  be  the  beat 

of  tha  light  tnat  produces  bias  in  tha  image  la  choice  for  certain  claaaas  of  objects. 
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If  blamrp  M)ki  tn  uud  la 

(ourca  m4  pupil  plana*.  It  1*  daalrabl*  Chat  th* 
fractional  araa  of  tha  pupil  aaak  occupied  by 
traaa parent  raplvn*  h*  relatively  aaall.  ocharwlau 
the  object  Praunhofer  pattern*  Incident  on  tha 
pupil  aaak  Bay  not  be  able  to  occupy  a 
aatlafactorlly  large  n unbar  of  poeltlona  relative 
to  the  aaak  atructur*. 

A  general  aatheaatlcal  analyala  of  the  acheae 
la  not  particularly  illualnatlng  becauae  of  the 
fuodaaental  nonlinear  nature  of  the  partially 
coherenc  iaaglng  operation:  the  reaultanc 
Integral  expreaalona  are  coaplicatcd  and 
uninformative.  Furthcraore,  the  conaidaration  of 
apeclflc,  analytically  tractable  exaaple*  la  rlaky 
In  that  conclualon*  aay  b*  au-geated  chat  are  not 
valid  In  general.  Monethcleaa ,  aoae  uaderatand'ng 
of  potential  llaladon*  of  the  echene  can  be  Bade 
by  aaaualng  different  apeclflc  charactcrlatlca  for 
the  coaplex  wav*  aapllcude  tranaaittance  of  the 
object. 

One  caae  where  a  general  conclualon  appeara 
poaalble  1*  that  of  the  highly  diffuse  object, 
e.g..  where  a  phototransparency  la  placed  In 
contact  with  a  diffuser.  In  this  case  the  object 
Fraunhofer  pattern  la  spread  out  In  the  pupil 
plane,  there  Is  no  predominant  dc  spot,  and  there 
1*  thu*  no  algnlf leant  laproveaent  In  laage 
contrast.  So  long  as  the  source  distribution  1* 
not  drastically  aodifled.  little  change  in  the 
appearance  of  the  laage  distribution  froa  that 
obtained  with  a  totally  unifora  source  1*  Co  be 
expected. 

Other  Exaaple* 

Several  different  pupil  and  source  aask 
combinations  that  are  more-or-l.sa  general  purpose 
In  nature  are  suggested  In  Fig.  5.  The  coablnatlon 
In  Fig.  S(a)  would  be  suitable  lor  enhancing 
vertical  edge  structure,  whereas  that  in  Fig.  5(b) 
la  appropriate  for  non-dlrectlonal  edge  enhanceaent 
(essentially  hlgh-psas  filtering),  hot*  that  In 
both  caeca  the  results  can  be  controlled  somewhat 
by  changing  the  relative  disaster*  of  the  pupil 
opening  and  the  complementary  source  obstruction. 
The  non -directional  enhanceaent  of  bandpass 
'tructurea  can  be  achieved  with  the  masks  of  Fig. 
5(c). 

PUPIL  OPENING 

SOURCE  IMAGE  — ^ 

<•) 


Fig.  5.  Source  end  pupil  aaak  configuration*  for 
(a)  vertical  edge  enhanceaent  and  (b) 
aon-d fractional  adga  enhanceaent. 


Figure*  4  and  7  oa  the  next  page  show  the 
reeulte  of  aoae  elaple  experlaente  with  complement 
tary  binary  source  and  pupil  aaaka.  Figure  6(a) 
la  a  reference  laage,  obtained  with  oo  aaaka  in 
the  eyetea.  Figures  6(b)  end  6(c)  were  obtained 
with  a  double-el  It  musk  In  tha  pupil  plane, 
tig-  6(b)  without  a  complementary  source  plane 
aaaka ,  Fig.  6(c)  with.  Figure  7  ehows  the  reaulta 
with  a  thin  annular  aperture  in  the  pupil  plane. 
Fig.  7(a)  with  fully  incoherent  Illumination,  Fig. 
7(b)  with  e  coapleaentary  source  plane  aaak  la 
placa. 

Concluding  Keaarka 

Aa  noted  above,  a  general  analyala  of  the 
acheae  le  not  particularly  enllghteoing.  Integral 
equations  can  be  written  that  dascrlbe  system 
operation,  but  they  provide  no  particular  insight 
Into  potsntlally  useful  configurations .  Systea 
performance  depends  strongly  on  the  apeclflc  class 
of  object  being  laaged— whether  It  Is  diffuse  or 
noo-diffuse,  of  high  or  low  contrast,  end  so 
forth.  It  Is  worth  noting  that  this  situation  la 
not  significantly  different  froa  that  found  In 
alcroacopy.  where  specific  techniques  (e.g.,  phase 
contrast  alcroacopy)  are  applied  beneficially  only 
to  certain  typaa  of  objecta.  If  the  techniques 
discussed  In  this  paper  ere  to  find  significant 
application,  an  extensive  experimental 
Investigation  la  required.  It  may  be  that  for 
certain  classes  of  objecta — specific  typea  of 
cytological  specimens,  for  exaaple— this  acheae 
will  provide  useful  feature  enhanceaent. 

More  extensive  Invest 1 gat Iona  are  in  progress 
to  taat  tha  effects  of  different  source  and  pupil 
aaak  combinations  on  different  kinds  of  laagary. 

Of  particular  Interest  are  coapleaentary  binary 
Basks  where  the  pupil  aaaka  contain  numerous 
pinholes  positioned  with  certain  spatial 
autocorrelation  charactcrlatlca.  He  believe  that 
thla  approach  will  allow  for  the  synthesis  of  a 
wide  ranga  of  "filtering"  operations  and  still 
allow  significant  Improvement  in  laage  elgnal-to- 
blas  ratio  with  low  contrast  Inputs. 

Thla  work  was  supported  In  part  by  the  D.S. 
Army  Research  Office  under  Joint  Services 
Electronics  Frogrea  Contract  DAAG29-81-K-0024 . 

*  K.  Koizumi  la  with  Vitaehl,  Ltd.,  Frod action 
Engineering  Research  Laboratory,  292  Toahlda-Cho, 
Toteuka-Ku,  Yokohama  246,  Japan.  Be  wee  e 
visiting  researcher  at  Georgia  Institute  of 
Technology  when  thin  work  wee  done. 
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6.  Bandpaaa  "flltarlns"  with  double-allt  waak:  (a)  conventional  apatlally  Incoherent  Image;  (b) 
Incoherent  lnaga  with  double-allt  pupil  aaek;  (c)  partially  coherent  Inage  with  complementary 
aource  week. 
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Fig.  7.  Lad  la  1  biotfpMi  "filtering"  with  annular  aperture  In  pupil  plane:  (a)  Incoherent  inage  obtained 
vltfc  pupil  plane  naak  alone;  (b)  partially  coherent  Image  with  complementary  maak  la  aource  plane. 
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IjktU  tin-  i>4  fpece-Integratloa  Method  for  Coaputar  Holography 
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A  conceptually  staple  aethod  far  produclag  a  hologram  of  a  coaputer-apeclf lad  object  la 
to  lllualnate  the  phocoeeaeltlve  recording  plate  alth  a  reference  eave  and  trace  out  tha 
ohjact  dletrlbutlon  with  a  point  of  light  aavod  bp  a  coaputer-cootrollad  ecannlog  spates. 

The  principal  drawback  la  the  poor  elgnal-to-nolea  ratio  (Sfl)  of  the  reconatructed  object, 
hacause  of  blaa  buildup  la  the  recordlcg  proceea.  Two  aathoda  for  Inprowing  reconstruction 
sal  are  discussed  In  this  paper:  (1)  aaalalalng  the  contrast  of  each  fringe  or  sone-plate 
pattern  exposing  the  holograa,  and  < 2 )  recording  lnteraedlate  holograas  of  portions  of  the 
object,  which  are  then  reconstructed  for  use  In  recording  a  final  holograa  of  higher  slgnal- 
to-blas  ratio. 

Introduction 

One  aethod  for  coaputer  holograa  generation  la  Illustrated  In  fig.  1.  A  fined  point- 
source  reference  contlououslp  lllunlnates  tha  holographic  plate  while  a  second,  autuallp 
coherent  point  source  scans  out  tha  desired  object  distribution  under  coaputar  control,  for 
each  and  awerp  point'  on  the- "object ,"  the  photographic  eaulsloa  is  Ulualnsted  bp  a  sene 
plate  of  specific  center  coordinates  and  focal  distance,  appropriate  to  reproduce  that  point 
upon  reconstruction.  The  final  holograa  constats  of  the  tlac-lategrated  sun  of  all 
contributing  tone  plates.  Production  of  such  a  holograa  was  first  reported  In  11(1  bp 
Caulfield,  Liu,  and  Harris  ill. 
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figure  1.  Tlae-integratlon  aethod  of  recording  "object''  described  bp  scanning  point. 

The  llaltatlona  of  such  a  holograa  are  well  known,  beceuse  each  sons  plate  exposure 
cerrlee  with  It  Its  own  blaa,  the  contrast  of  the  holograa  la  extrsaelp  low  for  anp 
reasonable  nuaber  of  object  points,  the  diffraction  effielancp  of  the  holograa  Is  low  and, 
of  aore  serious  consequence,  the  reconstructed  object  asp  be  dla  coapared  to  the  light 
scattered  bp  ftla  grain  and  other  seatterera  In  tha  optical  spstea.  As  a  result,  the 
slgnal-to-oolee  ratio  (SHI)  of  tha  reconstruction  will  be  low. 

Two  nethode  of  taprowlog  tha  8*1  of  the  final  reconstruction  are  discussed  In  this 
paper.  One  aethod  optlalsea  each  contributing  rone  plats  to  assure  optlaua  owerall 
diffraction  efftclenep.  The  other  aethod,  aore  coeplleeted  than  tho  first,  uses  s  two-  or 
aultl-stap  recording  proceea  to  further  laprowe  the  slgnal-to-blaa  ratio  and,  theranp,  the 
reconstruction  111  of  the  final  holograa.  The  second  aethod  la  coapleaentarp  to  a  aethod 
described  bp  Caulfield  elsewhere  In  this  proceedings.  In  the  sense  that  Caulfield's  scheae 
works  best  for  object  plane  holograas,  whereas  this  scheae  works  best  for  fourlcr  tranefora 
holograas . 
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AiilTIli  of  tho  gxolc  f tobloo 

(•for#  dleeueelng  tho  too  oothodo,  lot  uo  ho  ooro  that  tho  nature  of  tho  hole  prohloa 
t,  Pot  coavealeace  oo  opoelolloo  to  o  Fourier  tronoforo  hologre.  foroot,  oo 

oufgottod  hr  rig.  2.  Tho  ooao  p'otoo  of  tho  gooorol  Froonol  hologroo  cooo  oro  aow 
olnuooldol  triage  pattorat.  If  tho  frlago  ptttora  ooooclotod  olth  only  o  ologlo  object 
polot  lo  recorded,  the  contract  of  the  raeultaot  expoeuro  con  ha  ^ulte  high.  If  N  ouch 
triage  petterne  era  euperpoeed,  however,  each  cerrrlng  tte  ewa  hlee,  the  cootroet  or  elgoel- 
to-bloe  ratio  of  the  total  expoeare  pattern  will  he  low.  If  the  H-expoeure  halograa  le 
rocooetructed,  there  will  ha  a  bright  epot  on  axle  and  M  din  recoaetructed  object  potato  lo 
a  background  of  acattarad  light  (nolao).  The  nolee  level  la  eetabllehed  bp  the  average 
tranenlttance  level  of  tho  hologroo;  In  tha  low  contract  caee  of  concern  It  le  le 
eaxentlalljr  Independent  of  tha  elgnal  level.  Thua,  the  SMk  of  the  final  reconetructloo  lx 
proportional  to  tha  algnal-to-blae  ratio  of  tho  hologrea  recording.  In  analogy  with 
Incoherent  holography,  thle  decreaaee  roughly  an  (I/M)  ,  where  M  le  the  nunher  of  object 
polnce  [21. 
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figure  2o  Tvo-itep  process  for  improving  SNR  of  t Ime-lntegrat loo 
(Incoherent)  holography.  Multiple  Intermediate  holograms  are  made  of 
portions  of  tha  object  and  re-reeorded  with  reduced  bias  on  a  final 
hologram. 
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Method  I:  Optimise  fringe  Vlslbll i ti 


r  improving  reconstruction  SNR  la  to  aesura  that  each  fringe 
eech  objeet  point,  carries  with  It  no  more  bias  th.  a  is  necessary, 
the  reference  weve  et  the  photographic  plete  remains  constant  in 
the  reference  wave  maintains  the  tame  amplitude  as  the  object  wave, 
11  contrast,  or  unit  visibility.  In  order  for  the  fringe  amplitude 
eeeery  that  the  object  and  reference  wave  have  magnitudes 
ra  root  of  the  desired  fringe  amplitude.  Equation*  (1)  end  (2) 

•  In  these  equations,  U  (x)  denotes  tha  complex  wave  amplitude 
object  point  exposure,  l”(x)  the  corresponding  intensity,  and  l(x) 
dlst rlbut loo .  For  the  constant  rsference  snplltuda  case  we  have 

£b(x)  -  »  +  Onexp;  Jw0xJ ,  <U 
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where  R  la  the  reference  wave  amplitude  and  0  (x)  (-  jo  |axp(j«  ])  tha  amplitude  of  tha  mvc 
fro*  tha  ntj*  object  point.  tha  term  within  parenthesee  in  tha  ?asc  aquation  represents  tha 
bias,  vhertti  tha  final  tern  rapraaanta  tha  diffracting  "signs!"  structure.  For  tha  aqual 
object  and  rafaranca  aaplttuda  caia,  tha  corraapondlng  diat rlbut loan  ara  aa  follows: 
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In  thin  cone  both  the  object  end  reference  woven  hove  aagaltudea  equol  to  the  nquere  root  of 
0  ,  with  the  phone  9„bnln*  corrled  by  one  of  the  woven.  Coaperlog  Rqa.  (Ic)  end  (2c)  we  nee 
thee  the  dlffroctlng  nlgnel  dint rlbut loon  ara  proportional  to  one  another.  However,  In  the 
equal  object  and  reference  a.plltudn  cane,  ntnee  each  contributing  fringe  pattern  ban  unit 
vlnlbllltjr,  the  bleu  contribution  to  tha  co.poelte  exponure  dlotrlbutloa  In  the  abnoluta 
ulnlauu  pocnlble. 

In  a  19b*  paper  by  Honan  and  Maaauy  (3),  It  In  nntl.ated  that  approxt.ately  70,000  unit 
vlaiblltty  f'lnge  pattern,  can  be  recorded  on  Kodak  type  6*9F  photographic  platan  to  achieve 
a  reconatructlon  SNR  of  1:1.  Por  a  SNR  of  30:1,  It  la  necennary  to  reduce  the  nueber  of 
fringe  patterna  (nod,  hence,  tha  nueber  of  object  point.)  by  the  nquare  root  of  30.  Thun, 
only  about  10,000  objoct  point.,  correapoodlng  to  a  100x100  array,  can  contribute  to  a  tiee- 
lategratlon  or  Incoherent  hologrnn  on  6*9F  platen  if  a  reconatructlon  SNR  of  30:1 
(approxlaataly  talevlnlon  quality)  la  dealred.  Thin  number  la  too  avail  for  virtually  all 
appllcatlona  of  lntcreat,  and  ooaathlng  tore  aunt  bo  done  to  lap.ov*  tha  oltuatlon. 

Method  2 :  Hultl-atep  Procedure 

The  eethod  we  propone  for  achieving  greater  ieprovaeanc  In  SNR  la  llluotratad  In  ri«. 

3.  A  uo-itcp  process  Is  shown.  The  key  steps  of  the  aethod  ere  ss  follows: 

1.  The  N  object  points  ere  divided  Ir.to  M  groups  of  N/M  points  esch  (the  points  any  be 

contiguous,  or  they  asy  be  chosen  to  satisfy  statistical  objectives), 

2.  tots ra«d lets  incoherent  ( t lae-lategret Ion)  Fourier  traosfora  hologrsas  ere  recorded  of 

these  groups  of  object  points,  on  the  ssae  poloc-by-point  baste  «e  before.  Theee  N 

Interaedlste  hologrsae  will  here  better  s lgnal-to-bles  rstloe  end,  hence,  better 
reconstruction  SNR's  then  would  bs  the  esse  were  ell  N  frlngs  pattern*  tntargrated  In  s 
slogls  hologram. 

3.  The  M  groups  of  object  points  ere  reconstructed  froa  the  H  Interaedlste  hologrsae  end 
their  Fourier  transforms  rerecorded  In  multiple  exposure  fashion  on  s  single  final 
holograa  piste.  The  reference  wave  for  each  of  ths  aultlpls  exposures  Is  opttalssd  for 
asxlaua  diffraction  efficiency. 

Since  the  SNR  of  eech  loteraedlete  holograa  reeonst ruct ton  Is  better  than  If  ell  N 
fringe  petterne  had  been  recorded  laltlelly  on  e  single  plete,  the  SNR  of  the  ftnel 
composite  holograa  is  also  Improved.  Row  aueh  laproveaent  can  be  achieved  le  analysed  In 
the  next  section.  The  beelc  process  is,  of  courss,  rsletivsly  cuabersoas.  Further,  unless 
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the  BMltlrli  uroaurat  producing  tha  final  hologram  ara  made  carsfully.  reciprocity  failure 
■ay  lead  to  unequal  reconstruction  efficiencies  for  the  contributing  hologram  distributions 
(4,5)*  (Thle  vlU  he  true  of  both  the  Intermediate  holograea  end  the  fleet  hologram*)  Ve 
note,  however,  that  the  me  of  thermoplastic  recording  materials  lm  place  of  film  may 
eliminate  both  problems. 

SHU  Analysis 

lm  a  preliminary  eeaeasment  of  possible  Improvement  In  SHE  of  the  final  reconstruction* 
we  go  through  a  simple  calculation.  To  simplify  the  analysis  as  much  as  possible  we  make 
the  following  assumptions! 

1.  The  recording  processing  Is  each  that*  over  the  exposure  range  of  concern*  the 
resultant  wave  amplitude  transmittance  of  the  hologram  equals  the  (normalised) 
exposure;  t.e«*  t(x)  •  E(x). 

2.  All  fringe  patterns  contributing  to  the  Intermediate  K  holograms  have  unit  visibility* 

3*  The  bias  amplitude  transmittance  for  both  Intermediate  and  final  Lolograms  Is  0.3* 

(This  Is  not  necessarily  optimum  for  exposures  of  this  kind*  but  la  a  reasonable 
starting  point  based  on  studies  for  conventional  holography  (6)*) 

4*  The  object  points  have  equal  intensity* 

Rone  of  these  conditions  la  essential  to  the  achievement  of  Improved  SNR*  but  the  enalysls 
Is  greatly  simplified  by  them* 

Let  L  be  the  number  of  object  points  contributing  to  each  of  the  M  Intermediate 
holograms*  L  Is  chosen  such  that  a  reasonable  SNR  can  be  achieved  on  reconstruction  of  each 
Intermediate  holopram*  The  wth  Intermediate  hologram  has  amplitude  transmittance  of  the 
form 
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where  n  (x)  account*  (or  th*  grata  aola*.  ThI*  holograa  la  placed  In  th*  Input  plan*  of  the 
coherent  apatlal  filtering  apace*  of  fig.  1(b),  which  attenuate*  tha  aero  order  and  block* 
all  other  light  outride  th*  nth  partial  object  region.  Th*  roeultant  war*  amplitude 
Incident  on  tha  final  holographic  plot*  1* 
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where  o'  (x)  represents  the  fraction  of  the  grain  nolan  specific  to  the  mth  partial  object 
region.  *The  xero  order  Is  ettenueted  such  thst 

ft  -  |l/«iC> , 

e  choice  thet  equalises  the  exposure  biee  contributions  produced  by  the  reference  end  object 
waves  (Ignoring  noise*  thle  choice  optimises  the  slgnel-to-bles  ratio  of  the  exposure^.  The 
resultant  Intensity  Is  of  the  form 

V*1  ■  •2****  •  * 

•  (VM.)  ♦  (V»t)<Er  l  ooof  w  *  ♦  8 1  ♦  |l/«7Ejn'<x)  ♦  O.T.  13) 
n-1  "  " 

♦  (Vd^V  (*)  ♦  O.T. 

In  this  equation,  0*T*  Includes  other  terns  (Including  the  conjugate  of  Che  noise  term)  that 
do  mot  contribute  to  the  final  composite  object  reconstruction. 
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Than  the  aua  of  tha  H  aaltlp'.a  arpoeurao  aa  the  final  holographic  plate  it  given  by 


“tot™  *  V 


By  isiuapctoa  I  above,  this  equals  the  amplitude  transalt taace  of  the  final  holograa;  i.e., 

I* 

u>  -  (1/J)  ♦  I  [a/a.)  i  coai«nt").  ♦  0BW  1  Cl 

■*1  n-1 


♦  n^Cxl  )  ♦  r^Cx)  ♦  O.T., 


vkera  •.(>)  la  tka  nataa  produced  by  flla  (rata  la  tha  flaal  holograa  aad  O.T.  agala  daaotaa 
taraa  that  racoaatruct  outaldo  tka  object  ragloa.  laapaetloa  of  lg>  (I)  ehowe  that  tkla 
flaal  holograa  producaa  aa  object  eoaalaclog  of 

a  •  in  •  uE 

potato,  each  of  which  la  racoaatruetad  alth  tha  aaaa  diffraction  affleloaey  aa  la  obtalaad 

with  tha  Interaedlate  holograaa.  Each  partial  objaet,  coaalaclag  of  l  potato,  la 

racoaatruetad  with  lta  owa  aaaoelatad  aolaa.  la  addlrloa,  thara  la  tha  overall  grata  aolaa 
coatrlbutloa  froa  tha  flaal  holograa,  Since  tha  two  aolaa  proeaaaaa  are  ladtpaadoat,  they 
add  oa  aa  lataaalty  baala,  aad  wo  expect  tha  aolaa  tataaalty  la  tha  flaal  racoaatruct loa  to 
ha  graatar  by  a  factor  of  two  coaparad  to  tha  lateraadlata  racoaa t rue t loaa ■ 

■y  way  of  llluatratlon,  aaauaa  that  l  •  10,000.  ha  aotad  earlier,  thta  corraapoada  to 
a  racoaatruct loa  SIR  of  10:1  alth  thOP  platea.  The  total  auabar  of  object  palate  that  caa 

be  racoaatruetad  by  tha  flaal  holograa  la  tbaa  Halted  to  about  I  - 

b/fc  «  10*, 

corraapoadlag  to  a  1000x1000  array  of  poluta,  a  reepaetable  auabar.  Tha  SMR  of  the  flaal 
recoaatrucclou  predicted  by  thla  alapllflad  aaalyala  would  be  21.1. 

Coacladlat  taaarka 

Although  tha  aadal  need  above  la  highly  alapllflad,  the  aata  coaclualoa  ehould  he 
generally  valid:  atgatf tcaacly  faproved  SNR  lx  obtainable  ualog  tha  two-atap  procaaa.  Tha 
following  additional  polata  apply: 

1.  Although  coaalderad  la  taraa  of  a  polnt-by-polnt  tlaa  latagratloa  to  produce 
leteraedlate  holograaa,  tha  aaaa  prlaclplaa  apply  to  atandard  tacoharaat  holograaa 
alao.  Tha  key  point:  record  only  parta  of  tha  object  at  a  tlaa  oa  lateraadlata 
holograaa. 

2.  Many  of  tha  prohleae  with  photographic  flla— grain  aolaa,  lack  of  raal-tlae  operation, 
reclptoclty  failure— are  ellalnaced  If  tharaoplaat tc  holographic  recording  aatarlal  la 
uaad.  Cther,  unexpected  probleaa  aay  arlta,  however. 

1.  The  aaalyala  above  la  apeclflc  to  tha  Pourlar  tranefora  holograa,  but  axtaada  to  the 
aore  general  3-D  object  cate, 

A.  Tha  apparent  auccaaa  of  tha  two-atap  aathod  auggaata  that  Incoherent  or  tlae- 

lategratlon  holograaa  of  even  larger  nowhere  of  object  potnta  aay  be  poeelblo  with, 
a.g.,  thrae-ataga  aathoda.  Thla  propoeel  la  eubject  to  atudy. 


3.  It  lUnU  ha  noted  that  th«  partial  recording  technique  prepoeed  allova  (or  aoaa 

reduction  la  tba  apatlal  frequeac?  bandwidth  required  of  tha  holographic  recording 
aadlaa,  aloco  tho  object-object  latarfaraaco  toraa  produce  oapoaura  dlatrlbutlona  of 
eaaller  than  ooraal  bandwidth. 

g.  Appllcatlone  extend  bepoad  dleplap  holograph?  to  tlae-lt  gratloo  laago  procaaatng  and 
other  atgnal  procaaalng  applications. 

Thla  work  waa  aupportad  bp  tha  B.S.  drop  kaaaarch  Ottlea  under  tha  Point  Saralcee 
Electronic!  Progtaa. 
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Attract 


A  r«v  generation  of  opto-eleet roalc  algnal  procaiaori,  aany  exploiting  acoua t o-opt lc 
technology*  has  bass  4«talopla|  during  tha  paat  ie*aral  years*  These  processors  are 
designed  to  perfocn  algebraic  operations  like  matrix-vector  and  matrix-matrix 
aul t Ipl lcat loo.  A  number  of  major  architectures  are  revlcvid,  Including  some  that  operate 
using  digital  arithmetic*  Fundamental  limitations  are  discussed 

1 •  Introduction 

The  acousto-optic  (AO)  Bragg  cell  la  regularly  used  now  to  perform  signal  convolution, 
correlation,  and  spectrum  analysis*  During  the  past  several  years,  a  new  class  of 
applications  based  on  such  algebraically-oriented  operations  as  matrix-vector  end  natrlx- 
metrlx  multiplication  has  been  developed  for  this  versatile  device*  The  work  le  exciting 
beceuee  the  protestors  under  development  appear  to  present  significant  competition  to 
alternatlvn  all-electronic  (e.g.,  VLSI)  Implements t lone  They  are  feat,  can  eparata  on  low 
power  budgets,  and.  In  certain  cases  at  least,  can  provide  the  high  digital  accuracy 
required  of  some  of  the  uore  demanding  algebraic  signal  precaaalog  applications* 

This  paper  reviews  end  assesses  the  basic  AO  elgebreie  proceaaor  architecture*  that 
have  been  proposed  since  the  Ineeptloc  of  these  new  developments  In  Ittl.  Humorous 
individuals  at  different  inat itotlome  have  contributed  to  tho  research  effort;  A  partial 
Hat  of  references  la  included;  more  extensive  references  will  he  Included  In  a  subsequent 
paper  based  on  thle  one* 

Section  2  begins  with  e  dlecuaeion  of  the  basic  properties  of  Bragg  cells  that  have 
bean  exploited  In  then#  processors*  That  la  followed  by  brief  comments  on  the  wide  range  of 
higher-level  elgebreie  processing  operations  that  can  be  yerfornad  by  matrix-vector  and 
matrix-matrix  multipliers  when  combined  with  appropriate  date  handling  circuitry*  The 
remainder  of  the  paper  dlecueeee  specific  architectures,  which  use  both  tingle-  and  mu)tl- 
tranaducar  AO  cells,  for  analog  and  digital  accuracy  matrix-vector  end  mat r lx-oat r lx 
multiplication • 


2*  Important  Operating  Characteristics  of  Acousto-Optic  Celia 

Two  major  characteristics  of  Bragg  calls  have  been  exploited  In  proposed  AO  elgebreie 
processors:  the  cspablllty  of  modulating  the  intensity  of  a  beam  of  light  and  tha 
capability  of  deflecting  e  beam  of  light  in  different  directions. 

Vlth  am  AO  modulator,  illustrated  in  Fig*  1(e),  the  intensity  of  the  diffracted  beam  la 
glveo  by  the  intensity  of  the  incident  beam  times  the  diffraction  efficiency  of  the  acoustic 
grating  traversing  the  call*  Electronic  circuits  can  be  designed  such  that  this  diffraction 
efficiency  la  proportional  to  tha  amplitude  of  the  signal  that  le  Input  to  the  cell  driver* 

An  AO  beam  de&lector.  Illustrated  in  Fig*  1(b),  operates  an  the  princtple  that  the 
angle  of  diffraction  of  the  output  been  is  proportional  (In  tha  amall-angla  regime)  to  the 
temporal  requency  of  the  acoustic  wavs  to  the  call.  One  can  thus  choose  the  direction 
taken  by  *:he  diffracted  beam  by  choosing  the  frequency  of  the  driving  signal*  Depending  on 
cell  type,  the  frequency  la  typically  In  eha  20  HRs  -  2  CRs  range.  As  suggested  la  Fig* 
1(b),  if  M  nlnueotdnl  signals  of  different  frequencies  ere  almulteneouely  Input  to  the  cell, 
H  diffracted  bssns  result,  each  propagating  in  its  own  direction*  By  changing  the 
amplitudes  of  the  different  sinusoids,  one  can  control  the  emouet  of  light *eeet  la  the 
different  directions* 

Ac ous co-opt le  elgebreie  processors  operate  with  oultlple  Input  end  output  beans,  as 
shown  la  Fig*  2*  Eeeh  Input  beam  le  modulated,  deflected,  or  both,  by  grating  sagmants  In 
d&ffareat  regtona  of  tha  Bragg  cell*  The  lnteneltlea  of  the  input  beene,  produced  by  LED's 
or  laser  diodes,  generally  very  with  time*  Figure  2(a)  shows  a  mult t-chanaal  bias 
modulator*  In  this  cast,  tha  Bragg  call  la  imaged  onto  an  array  of  detectcrt  such  that  each 
nodulated  beam  illemiaatee  a  different  detector*  The  Imaging  ayatan  la  of  the  Schllvrea 
type:  e  step  la  the  back  fecal  plana  blocks  all  Light  not  dlffractad  by  tha  acoustic 


|utU|  ri|(ti  2(b)  nhowe  •  )•>•  deflector ,  aktu  light  (rn  t 

given  input  Nu  c«u  b«  ml  elnultaneoualp  to  007  enkrat  of  tha  deteetore  la  tho  kack  focal 
plaar  of  tho  Iona.  farther,  tho  deflected  hoaao  caa  aach  ha  caatrolled  la  latenattp.  Tha 
acoueta-optlc  davlcea  caa  ba  thought  of  aa  providing  a  uatghted  aoltchlag  aatuork  that 
caoaaeta  X  laputa  to  K  outputa  oitk  oariakla  aolghta.  ta  tha  koaa  aodalater  coat,  X  equale 
■  a  at  tho  coaaaetloaa  arc  ooo-to-ono.  la  tha  baa*  deflector  caaa,  H  c.*a  differ  frcn  f,  and 
tha  coaaactleaa  arc  oae-to-naay.  la  both  caaae,  procaaalag  archltaeturaa  oaplott  tha 
plpollao  novaaent  of  aceuatlc  gratiag  aagaaata  ftoa  aaa  boaa  poaltloa  to  tha  aaat. 

It  la  lapertaat  that  tka  fua4aaaatal  llattatfoaa  of  40  coll  operation  bo  ua4oratoo4  la 
ardor  to  doteralao  tna  llaltatloaa  of  tho  calla  la  apacttlc  algahrale  pracoaalag 
coaf tguratteaa.  Coaelder  tho  baaa  aodalater  caaa  flret.  Kith  rofaraaca  ta  **.g.  1,  lot  oach 
Input  boaa  illuataata  a  aogaoat  of  acouatlc  aaoa  "algaal"  of  tooporol  duration  |T  (equal  to 
tha  uldth  of  tha  baaa  41*14a4  bp  tha  acouatle  wave  aalocltp).  la  order  for  tha  lncaaeltp  of 
aach  baaa  to  ba  aodulatad  ladapaadaatlp.  It  la  aacaaaarp  that  AT  aattafp  tha  condition 


AT  >  1/1, 

where  B  ta  the  taaporal  frequency  bandwidth  of  tha  call.  If  tho  acouatle  uawo  traaalt  tlao 
for  tha  entire  call  la  1  aacoada,  than  tho  aamlaua  auakor  of  laputa  and  outputa,  »,  la 
reatrlctod  bp 

■  -  T/AT  <  Tl, 

where  TB  la  tha  t loo-bandwidth  product  of  tho  call.  Typically  thta  auahor  llae  batwoaa  200 
aod  2000,  la  practlca,  tha  auahar  of  latarcoaaaeta  will  prohahlp  bo  algatf lcantlp  lowor 
than  thac--p<rhape  batwaaa  50  and  200. 

fondaooaeal  llaltatloaa  oa  tha  baaa  daf lector  approach  a -a  aatabllehed  with  raforcnca 
to  Tig.  4.  In  Pig.  4(a)  It  la  aaauaad  that  thara  la  one  lapel  baaa.  (Tho  coaf lguratloa 
ahown  la  eaaeatlally  that  of  aa  A0  apectrua  aaalpaar.)  Light  froa  thle  boaa  can  ha 
directed,  la  parallel  and  with  Individually  controlled  weighting,  to  anp  eoablnatloa  of  M 
outputa.  whara  X  <  TB,  tha  tlae-bandwidth  product  of  tho  call.  If  H  eacaada  TB,  tho  annual 
of  light  cent  to  aach  detector  cannot  ba  controlled  ladepondintly  (croaa  talk  raunlta).  In 
»lg.  4(b),  thara  are  two  Input  haaaa.  Bacauaa  oalp  half  tha  call  la  need  for  a  given  Input 
baaa,  the  nuahar  of  raaolvad  output  dataetora  aunt  ha  reduced  by  a  factor  of  two.  la 
general ,  aa  ahown  in  Pig.  4(c).  H  ieputa  caa  ba  coupled  to  no  aoro  than  TB/H  outputa  If  tho 
connection  wetghte  are  to  ba  Independent. 

Charactarlatlce  and  fundaaeatal  llaltatloaa  for  tho  two  aodaa  of  operation  are 
euaaarited  la  Table  1.  In  tha  table  both  tha  aaxiaua  auabar  of  polat-to-polat  conocctloaa 
and  the  aaxiaua  auabar  of  laputa  and  outputa  have  baaa  entered.  Two  olgatflcant  dlfftrancee 
atand  out.  Plrat,  aa  noted  before,  tha  baaa  dafloctor  node  allowe  for  ona-to-aauy 
lutercouaacte-- aaaantlallp  global  la  nature— 'vhereee  tha  baaa  aedulator  node  allova  only  for 
oec-to-ona,  nr  local,  latarconaacta.  On  tha  other  hand,  the  auabar  of  laputa  and  ovtputr 
both  cqucl  TB  for  tha  bean  aodulator  caaa,  wharoaa  for  tho  baaa  defloctcr  caaa  tha  product 
of  the  nuabrr  nf  lnputa  with  tha  auabar  o 2  outputa  ta  Halted  to  TB. 


Table  I.  Suaaarp  of  Charactarlatlca 


Interconnect  ona-to-ona  oaa-to-aany 

Typo  (local)  (Global) 


Muaber  of 

Poaatbla  <  TB  <  TB 

Point-to-point 

Coaaaetloaa 


Nuaber  of  I  laputa  M  laputa 

laputa  aad  N  Outputa  M  Outputa 

Outputa  ■  <  TB  KB  <  TB 


A  further  practically  laportant  difference  batveaea  tha  two  aodaa  of  operation  ahould 
bo  noted.  In  tha  bean  aoduletlon  node  of  operation  relatively  high  diffraction  efflctaactoa 
can  bo  achieved  without  aonllnearltlee  (which  aio  lnhoront  In  tha  acouato-opt le  diffraction 


^•(•(•1  cnplttitlil  Uttar*  tea  auch.  lanfii,  la  th*  kata  deflector  node  at  operation. 

It  1*  quit*  difficult  to  caattol  ItlltllalHl  tka  Intaaaltlaa  of  tk*  lldtrrtt  diffracted 
koaa*  If  high  diffraction  efficiency  la  daalrad:  aaallnaar  coupling  *a4  karaoole  eeapoaost* 
t*  tk*  grating  traaaalt taaca  distribution  latr*4aea  taa  aack  creaatalk.  Thu*,  la  *lt*al 
procaaatag  applteaclan* ,  aalt l-f ragueacy  kaaa  4aflact*r  4**lc*a  aaat  h*  aa*4  at  lav 
diffraction  af f Iclaac taa ,  attk  a  aubaaguaat  vaata  af  lapat  light. 

S.  Iaaartaat  Alaahralc  Ptaeatalaa  Oaeratlaa* 

A  aaabar  af  altakralc,  net rtx-ar tented  apatatlaaa  ar*  lapartaat  to  ao4ara  algaal 
processing  appllcattoaa  aack  a*  caatral,  pattata  racegaltlaa,  a4*ptl*a  kaaa  foreleg, 
41r*cttoa  finding,  a*4  apactral  aaalyata.  Particularly  lapartaat  eparatloaa  lacla4a  aatrlx- 
tactor  aultlpllcatloa,  aatrlx-aatrlx  suit lpllca t lea ,  Graa  Schal4t  orthogonal last  Ion , 
aolutton  at  aata  of  llaaar  agaattaaa,  dataralnat lo*  af  eigenvector*  aa4  eigenvalues  at 
aatrlcaa,  alagular  value  decoapea 1 t tea  af  aatrlcaa,  and  laaat-aqaaraa  aatlaata*  af  aalatlaa* 
af  aata  ar  llaaar  aguatlaea.  Of  tkaaa,  tka  flrat  taa— aatrlt-aaetor  aa4  aatrla-aatrls 
aultlpllcatloa— arc  tk*  aaat  kaalc,  *a4,  la  fact,  they  aft**  fara  a*  Integral  part  af  th* 
other  operation*.  Thu*,  tkara  kaa  kaaa  coasldarabla  aaphaal*  ultkla  tka  optic*  caaaunlty  aa 
daaaloptag  accurate,  hlgkapaad,  aaraatlla  procaaaara  apaelflc  to  thoa*  tvo  taaka.  A 
aubaequant  (and  aft**  aoatrlrlal)  taak  la  t*  datarala*  how  tuck  procaaaara  caa  ka  k*tt 
conf tguiad  la  larger  ayataaa  ta  parfara  tk*  higher  order  algebraic  operation*  aotad.  I*  tk* 
following  aactloa*  v*  dlacaaa  a  warlaty  af  apaelflc  AO  praccaaar  archltacturaa  far 
perforatng  th*  two  kaalc  aatrlx-vector  aad  aatrlx-aatrlx  product  apatatlaaa. 

4.  Ilagla-Traaaducar  Archltacturaa 

U*  bagla  with  procaaaara  that  aa*  alagla-tranaducar  AO  call*,  caaaldarlag  flrat  a 
aatrix-ractor  aulttpllar.  Par  rafaraac*  wa  aata  that  th*  caapoaaata  af  a  utrlr-vactar 
product  hac*  th*  fara  aaggaatad  by  th*  lx)  axaapl*  af  Cg.  < 1 > t 

*1  *11  *11  *11  *1  ’l  "  *11*1  *  *12*2*  *11*1 

»2  *  *21  *22  *21  *2  *  rt*r*  *1  “  *21*1  ♦  *22*2*  *21*1  <l> 

»)I  l*)l  *12  *)l|[*l  *1  *  *11*1  *  *12*2*  *))*) 

The  flrat  laplaaaatatlaa  dcacrlkad  la  baaed  on  th*  kaaa  aedulater  aod*  at  aparatlaa. 
yigur*  1(a)  ahov*  a  ayataa  coaflgurad  for  th*  aul t ' plleatlen  of  a  2-coapaoaat  vector  by  a 
2x2  aatrlx.  Till  conf Iguratloa  wa*  awggaatad  by  Tatar*  (l|  la  raapaaa*  ta  aa  earlier  acheae 
prepaaad  by  Caulfield  aad  node*  (21.  Th*  processor  caaalat*  af  aa  Input  laaar  dlod*  (LO) 
array,  a  colllnatlan  laaa  far  each  aourca,  aa  acouato-aptlc  call,  a  Schliaraa  laaglng 
ayataa,  atd  a  llaaar  array  af  lategratlag  detectors. 

Th*  flrat  Input  to  tka  acouato-aptlc  call,  vector  coaponaat  *.,  produce*  a  ab-rt 
diffraction  grating  with  diffraction  efficiency  propartlonnl  t*  x.'that  aovaa  acre**  the 
call.  Hhaa  that  grating  aagaaat  ta  la  franc  af  10  fl,  a*  nhown  14  fig.  1(h),  th*  laaar 
died*  1*  pulaad  with  light  energy  proportional  ta  aatrlx  coefficient  a,.,  aad  integrating 
detector  *1  I*  llluateatad  with  light  energy  la  propartlaa  t*  tka  product  a.,*,.  Th*  naxt 
critical  aaaoat  occur*  whoa  th*  a,  grating  aagaaat  la  la  fraat  af  10  #2  aad1*  dacead  grating 
••gaunt,  with  diffraction  efficiency  la  proportion  ta  vector  caapaaant  x, ,  haa  aovad  in 
front  of  LO  *1,  aa  above  ta  fig.  5(c).  At  that  aoaaat  LO  fl  la  pulaad  with  light  aaargy  la 
propartlaa  ta  a.,  and  LO  #2  la  pulaad  with  light  aaargy  la  prepattle*  to  a,,.  The 
Integrated  output  af  detector  fl  la  nuw  proportional  ta  a, .a,  ♦  which  la  tha  output 

vector  caapaaant  y..  Tha  Integrated  output  af  detector  II  li  a, .if  it  thla  ataga.  Tha 
final  critical  aaedat  in  tha  cauputatiea,  shown  In  Pig.  5(d),  oleufa  after  gracing  aagaaat 
x,  haa  aovad  In  fraat  af  LD  f2.  A  final  pair*  fro*  that  laaar  dloda,  la  proportion  ta  *„, 
yield*  at  th*  output  of  detector  f2  a  voltage  la  proportion  to  ♦  *22*2'  tt>a  ,,c“^ 

coapoaent  y j  af  the  aatpat  vector.  The  ceaputattea  la  aav  coapldta. 

Tha  avaluatloa  of  a  aatrlx-vactar  product  by  thla  pracaaaar  takaa  T  sac  (th*  AO  call 
tlaa  wladaw)  bafora  thn  flrat  y,  ceaaa  out.  (T  la  thua  t'.a  latency  af  th*  pracaaaar.)  It 
takaa  another  T  nee  ta  coaplate1the  entlra  aatrlx-vactar  product,  yielding  a  total  af  2T 
procaaa  tin*.  ,Tha  aaxlauu  auabar  of  aparatlon*  (ault tply/adda )  that  caa  ka  parforaad  1* 
that  da*  la  ■  ,  whara  »  <  IT.  Thua,  tha  theoretical  Halt  aa  arocaaotag  rata  la  glvo*  by 

Precasting  tata  <  »2T/2  aparattnaa/aac. 


a  10  uaac  tlaa 


Operatiea  of  the  preceeeor  la  uillr  titu4(i  to  wltti-utrti  aultlpltcatloa  if  it  la 
Hltl  that  a  net rla-aetr lx  praliet  eaa  ba  evaluated  aa  a  auecaaaloa  at  aacrla-vector 
prodecta.  Tkaa,  net rlx-natrlx  product  £  «  *1,  |l«ai  la  tha  It)  eaaa  by 

*11  *11  *11  bll  b12  b13  *11  *12  *13 

*11  *21  *23  b21  b22  b23  “  *21  *22  *23  (2) 

*31  *31  *33,  b31  b32  b3S  *31  *32  *33 


At  A,  A,»  A,  »  -  t  £,  «,  £S  J. 


■  b21  ■  *2  ■  b22 


for  large  aatrlcaa  tha  latency  caa  ha  Ignored,  aad  tha  proceoelng  rata  la  eeeeatlally 
<g  T/2)  ope  rat  lone  par  aecood. 

It  akoald  ba  atraaaad  that  thla  pracoaaovaad  tha  athar  pracaaaara  dlacuaaad  la  thla 
paper— aparataa  with  light  lataaaltlaa.  which  ata  alwayo  ooonegative.  Than,  If  hlpalar  at 
coapleu-valued  vactora  aad  aatrlcaa  ara  to  ba  aultlplled,  aultlplaalag  er  cadlag  achaaaa 
acat  ba  eaplnyed.  A  variety  of  aathode  hava  baaa  propaeed;  all  raault  la  aaaa  reduction  la 

ayataa  throughput  (typically  a  factor  of  two  or  three)  aad  aa  laeraaaa  la  ayataa  coaplealty. 

figure  i  llluatratea  a  ayataa,  prepoaed  by  Caaaraat  at  al.  (3),  that  aaplolte  both 
acewcto-optlc  aodulatlon  aad  baaa  deflection  for  perforalag  aatrlx-vactor  aad  aatrlc-aatrla 
aultlpllcatloo.  Initially  all  laaer  diode  aeurcea  are  off  while  thegragg  call  la  loaded 
with  a  aequance  of  coapoalte  grating  eegnaate,  each  of  which  caa  diffract  light  froa  a  given 
Input  bean  to  any  coablnetlon  of  output  dutactora  with  arbitrary  weighting.  Khan  tha 
coapoalte  gratlnga  are  In  tha  correct  poaltloae,  tha  laaer  dlodee  ara  atrohad  on  with 
Intonaltlea  proportional  to  U|,  ij,  etc.,  aa  ahowa. 

To  work  with  a  ceacreta  eaaapie,  aaauaa  that  output  vector  coapooent  y  la  given  by  y, 

•  3a,  ♦  da,  ♦  2a,.  When  the  aourcea  are  atrobad,  heaa  1,  with  lntenelty  proportional  to  a{, 

haa  part  it  lta  energy  diffracted  to  detector  1  with  diffraction  efficiency  3k,  k  being  eeao 
proportionality  coaetaat.  Slaultaaeouely ,  baaa  2  la  diffracted  to  tha  aaaa  detector  with 
diffraction  efficiency  Ik  aad  baaa  4  with  diffraction  efflciaacy  3k.  Tha  raault  la  an 
output  at  detector  1  proportional  to  k(3a.  ♦  da,  *  2a,),  t.a.,  proportloaal  to  y,.  It  the 
aaae  tlae  thla  la  happening,  light  la  aloe  being  diffracted  la  propat  aaounta  to'tha  other 
detectora  to  calculate  yJt  yJt  etc. 

Thu  aaouat  of  tlae  It  takaa  for  a  alngla  uacrla-»7tctor  product  to  ba  evaluated  la 
deteralned  alaoat  entirely  by  tha  fill  tlae  T  for  tha  AO  call  (tha  flaah  tlae  being 
negligible  by  coaparleon).  During  that  tlaa,  R2  analog  aultlply/adde  are  perforaed,  where  R 
<  (IT)'  The  nuabet  of  operatloae  perforaed  per  aeeoad  thua  equale  tba  cell  bandwidth  >. 
Table  2  auauatltaa  the  dlffarencaa  betwaea  the  baaa  aodulator  aad  baaa  deflector  approachaa. 
The  nuabera  aaauae  a  Iragg  cell  bandwidth  g  of  100  KRi  and  a  tlaa  window  T  of  10  uaac. 


Becauaa  of  the  aaall  aatrla  dlaenalenallty  that  can  ba  accoaaodated  by  the  been 
deflector  ayeeea  J(fT)1  ,  aa  oppoaed  to  tha  dlaenalenallty  gT  achieved  by  tba  baaa 
aodulator  ayataa)  thla  ayataa  doea  not  look  attractive  for  alapla  aatrle-vector 
aultlpllcatloo.  Rowevar,  when  aatrta-aatrla  products  ara  eonaldarad,  parforaaace 
tli#  two  ii9t oichii  1*1  lOMvhit  elei«r a 


Ratrla-natrla  aultlpllcatloo  uelog  thla  baale  architecture  la  llluatratad  la  Tig.  T. 
The  approach  la  phllaaophleally  tha  eaaa  aa  wee  dlacuaaad  baforei  aatrla  C,  given  by 
aatrta-aatrla  product  Ag,  la  ealculetad  vector  by  vactor,  aa  (a  Iq.  (3).  At  tha  laataat 


depleted  ti  ft*.  7,  M«t«f  c.,  Mjmnui  kf  l  e.|  |*  (where  t  4«**t •«  traaapeae), 

1*  calculated  by  flaabiag  tab  l**«r  41o4«*  la  pit^ittiH  it  *..,  a  ad  b-, ,  at  ahovo. 

T/H  aacoada  litit  tfc*  grating  H|uati  hawa  aov«4  ay  to  cba  adit  Milttoi,  iitra  e.  •  [  *. 

c„  |‘  eaa  b*  evaluated,  aa4  to  forth,  far  tba  aatrtu-aatrla  praduct  evaluation  therd1 
ll‘a  1/2  aae  lataacy  (aaaunlng  f  la  atlll  tha  transit  tlaa  of  Cba  ectlre  Bragg  call),  T/2 
aac  additional  processing  Claa,  and  a  total  of  «J  uul t lply /adds  part oraad ,  ahara  H  < 
(!/*)(»?)  ,  for  aa  avarall  procoaalag  rata  of 


Froeaaalag  taco  »  (IT) 


A  typical  rata  with  (  •  100  MBs,  T  "  10  oaac  1*  dnIO  opa/aac.  Macrf n-B*crln  aultlpllcatloo 
charactarlatlca  for  cba  too  baalc  procaaaor  archltacturaa  ara  alao  euuwarlsed  la  Tabla  2. 

It  ahoold  ba  aaphaaltad  that  thaaa  llaltatloaa  ara  theoretical,  aa4  that  la  practice  other 
considerations  aay  ba  overriding.  Aa  noted  earlier,  AO  dofloctloa  of  Incident  boaaa  late 
aultlplo  dlacrato  dlractlooa  la  lapoaalblo  at  blgb  diffraction  offtclaacy  (onlaaa  tba 
Inherent  nonlinear  affecta  are  eonehov  pracoapaaaatad) ,  and  aatrln  dlaenaloaallty  la  Halted 
to  perhape  10  or  11  for  aqnara  aatrlcoa.  On  tba  other  hand.  It  aay  ba  lapractlcal  to 
laplaaent  tha  beaa  aodolator  aethod  with  area  100  laaar  dlodaa,  let  aloaa  tha  fall 
coapluaant  BT.  Thun,  tha  thaoratlcal  adeaatagea  of  tha  haaa  aodolator  aethod  aay  aawer  ba 
rcallaad  In  practice. 


Matrix-Vector  Multiplication 


Coaparlaon  of  Mathoda 


BEAM  HOBQLATOt  METHOD 


Matrln-Matrln  Multiplication 


T  aac  latency  (before  flrat  y{  caaaa  out) 
T  aac  to  coaplota  confutation 
»  <  BT 


3«J0l  opa/aee  for  T  •  10  oaac,  B  •  100  MHa 


B*T/2  oparatlona/aoeoad 
SnI0t0  opa/aac 


BEAM  DEfLECTOE  METHOD 


T  aac  latency 

Ecaaotlallz  no  additional  proceeelng  tlaa 
H  <  (BT) 1 ' 1 

B  spare tloaa/aecoad 

10s  opa/aac  for  T  -  10  noac,  B  -  100  MEi 


(BT) ,  r BT  oparua/nac 
4*10  opa/aac 


)•  Multi -Transducer  Archltacturaa 

Thun  far  all  ayateaa  deacrlbed  have  uaad  AO  celle  with  alaglo  traenducern— l .a. ,  only  a 
alnglo  acouatlc  heaa  la  preaent  for  acouato-opt 1c  Interaction.  Soao  of  tha  aoat  recant 
devclopaeata  In  AO  etgaal  proceaalng  have  baea  baaed  an  aultl-traeedwcer  call  archlceeturee. 
High-quality  AO  celle  have  bean  fabricated  with  aa  aany  aa  100  treaaducara,  oach  producing 
tea  ova  laolatod  acouatlc  beaa,  and  call*  with  treaaducara  la  tha  10  to  10  range  can  now  ba 
fabricated  on  o  regular  baala. 

Aa  a  flrat  acaaplo  of  a  aultl-traaaducar  Bragg  call  architecture ,  wo  conatdar  aatrln- 
aatrln  aulctpllcetlon  uatag  AO  beaa  asdulatloa  aethod*.  figure  B(a)  abowa  a  ayataa 
coaatatlng  of  two  thrne-traoaducar  A0  call*,  laagad  with  Schllaran  optica  onto  ooa  another 
and,  aubeequently,  onto  a  la)  array  of  detector*.  Illualnatloa  la  apaelally  ualfora  aad 
pulaad  la  tlaa.  Bacaun*  of  tha  Schllaran  tanging  optica,  only  light  diffracted  by  botl  A0 
cell*  arrive*  to  the  detector  plane.  Thun,  If  raw  traanducor  1  aad  coluan  tranaducer  2  are 
tha  only  two  to  receive  algnala,  only  datactor  (1,2)  will  ba  llluatnatad. 

Tor  aatrtn-aatrl*  aultlplleatlan  tha  coaponentn  of  tha  Input  aatrtce*  are  nequenced 
Into  the  t  to  orthogonal  call*  a*  auggeatad  by  fig.  B(b).  The  coefficient*  a  ,  are  Input 
horlnontally ,  a  flrat,  tha*  a,,  and  a,.,  aad  ao  forth,  alaultanaoualy ,  the Jeoef f lc lent  a 
are  Input  td‘th*  vortical  edit  transducer*,  b, .  flrat,  than  b,.  and  b.,,  ate.  At  they 
aowo,  tba  grating  aegaeata  repreaaatlag  then*  nuabdrs  af f actlwalyICroaa  oat  another  la 
■paca,  eaualng  light  to  b*  diffracted  to  datactor*  la  corraapoadlng  apatlal  locatloa*.  The 
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riot  algnlf  lent  MMit  occure  when  grntlag  aegnenta  1..  a  at  a.,  at*  laa|al  oata  aach 
•that.  At  that  laataat.  tha  caaasa  aource  la  pulaed,  aha  light  daatgy  In  proportion  to  tha 
icalatt  e-.b..  la  aaat  to  Intagrntlag  dntnetor  (1,1).  A  abort  tlao  latar,  aftar  noveuent 
of  tha  itlllli  aegnenta  through  oaa  haaa  width,  light  lnteaalty  la  proportion  to  tha  product 
a  ,1,.  la  aaat  to  tho  aaaa  dotoctor,  and  aa  forth,  until  tha  aatlra  aatlra  aua  k(a,.b  .  a 
aJ?b;J  ♦  <k  eaaataat ) ,  proportional  to  «...  haa  haaa  latagratod.  Slallarll!  il 

oltat *tatoftratlag  dotactoro,  othar  partial  auaa  ate  batog  aaaluatad  to  calculata  output 
aatrla  coofflclanta  c . . ,  c,.,  ate.  Aa  hatora,  all  nuahara  auat  ba  aonuegatlve,  and 

coding  or  aultlplualn i  jooI *bo  afod  to  laploaaat  bipolar  or  coupler  arlthaatlc. 

A  oacoad  aaaapla  -  a  nult 1-traaaducar  archltactura  la  tho  optical  outer  product 
calculator,  Uluatratc-l  achenatlcally  la  fig.  9  (t,S).  In  thla  caaa,  tha  lndlaldual  aound 
coluana  la  tha  aultl-tranaducer  AO  call  as*  abort,  tarring  eaeeatlally  aa  point  aodulatora 
for  light  paaalag  through  than.  Light  froa  tha  cortical  laaar  diode  array  la  apread  out  and 
recollected  by  optica  not  ahowa  ao  aa  to  lllualnata  a  aquern  array  of  datectora  la  tho 
output  piano.  Tho  lntaaalty  at  aach  horlaoatal  row  la  controlled  by  a  glean  LD  aourca;  tha 
Intensity  ac  aach  colunn  of  tho  output  array  la  controlled  by  a  glean  AO  aound  ware,  by 
3uch  an  archltactura  It  ta  poaatbla  to  calculato  outer  prodneta,  l.a.,  nacrla-aat rtx 
producta  of  tha  typa 


*ll  *>1  *2  *3> 


*11  *11  *13 
*21  *22  *23 
*31  *32  *33 


Such  a  calculation  la  Integral  to  tha  calculation  of  coeartanca  natrlcea— »of  groat 
lnportanca  In  algnal  prpcaaalng— -and,  aa  auggoatad  by  Eq.  (A),  a  auccaaalon  of  outer 
product*  can  ba  unad  to  calculato  arbitrary  antrlr-aatrlr  product*  a*  wolli 


bU  bl2  b13| 


|cll  *12  *13 


*21  *22  *231  b2i  b22  b23|  “  *21  *22  e23 
j*31  *32  *33  (  b3t  b32  b33|  1*31  *32  *)l| 

*lll,bll  h12  ‘iS1 2 3  [*12]tb21  b22  b23J  [s13l  tk31  b32  b33 


-  *21 


An  acouato-optlc  doelca  1*  not  rnqulrod  for  thla  operation-— only  aona  hind  of  nultl- 
tranaduenr  linear  array  nodulator.  Bowaear,  AO  call*  appear  to  ba  attraction  candidate*  for 
auch  a  tank. 

4.  Digital  Accuracy  Matrix-Vector  Multiplication 

Tha  proceaaor*  daaerlbad  abae*  have  dynaalc  range  and  accuracy  datarnlnod  by  the 
eourcaa,  nodulator,  aod  datectora.  Output  accuracy  la  United  to  eight  to  ten  blta,  often 
Inadequate  for  demanding  algebraic  algnal  proceaalng  taaka.  Several  nethoda  for  performing 
algebraic  arlthnetlc  optically  with  digital  accuracy  have  boon  described  by  Cullfeylo  (4|, 
Athalo.  Collin*,  aod  Stllwell  {7],  and  by  Seeker,  Clayton,  and  Stanley  |S).  Son*  of  tho 
underlying  concnpta  aro  dlecuaaed  In  thla  aectlon. 


1.  Digital  nult lplleat Ion  can  be  perforaed  by  naan*  of  dlacreta  convolution  (aorlal 
product)  H). 

2.  Dlacrato  convolution  can  be  lnplanentnd  In  torn*  of  a  natrtn-vector  product  (10|. 

3.  fort  it lonlng ,  la  conblnatlon  with  dlacreta  convolution  via  natrtx-vector 

nult  lplleat  laa,  allow*  aatrlx-vector  (and,  for  that  natter,  aatrlx-aatrlx)  product*  to  bo 
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calculated  with  digital  accuracy. 

feint  I— digital  aultlpllcatloa  kg  dlacrata  convolution — la  denonoerated  b /  aa  exanple. 
Aaaane  tha  two  (declnal)  auabara  M  and  15  ara  ta  ka  nultlplled,  to  ektala  tha  raault  5*5. 
Although  any  haaa  eaa  bo  wood  for  tha  digital  repraeaetatlon,  wa  aoa  baaa-2.  Tha  baaa-2 
aaltlplleatloa  haa  tha  fora 

10  0  111  Of) 
till  (IS) 


1  0  0  l  1  1 
10  0  111 
10  0  111 


Tha  lntaraadlata  naabor,  111223121,  repreeeata  tha  flaal  raaalt  In  nlnod  binary  forat  aach 
digit  repreeente  tha  walght  af  2  ralaad  te  that  power;  tha  walghta,  howaaar,  ara  aot 
raatrletad  to  ka  0  or  1.  Tha  bettoa  linn  af  tha  calculation,  1001001001,  la  tha  ataodard 
binary  fora  for  tha  raaultaat  groduct  3*5.  Row  nota  that  tha  aland  binary  raault  can  ka 
obtained  by  conaolalng  tha  two  eaqueacaa  af  1'a  and  0'a  ragraaaatlng  tha  lnguta; 
agaclf leally, 

(10011  1)*(1  1  1  1)  •  (1  1  1  2  2  3  3  2  1).  (g) 

To  calculate  tha  dlacrata  convolution,  tha  two  aeqwencee  ara  antarad  In  tha  aggrogrlata 
locatlona  of  a  vector-aatrlx  groduct  calculation.  In  particular.  It  la  anally  ahowa  that 
tha  antrlx-vector  product 


raaulta  In  tha  anna  aaquence  of  auabara,  (e,  c j  c-  e.  c.)  (tha  eaapoaanta  of  tha  output 
aactor)  aa  doaa  tha  aarlal  product  1  5  * 

(•l  «j  ajlMbj  b2  *3*  *  *C1  *2  e3  e*  e3*  '  (*®> 

All  thta  la  parhapa  not  too  aurgrlaing  whan  It  la  aotad  that  tha  baalc  aparatlaaa  of  both 
natrlx-natrlx  nul t lgl lea t ton  and  dlacrata  canaoluttoa  ara  aultlpllcatlon  and  aunalng.  Tha 
operation  gartoraad  la  not  truly  digital.  In  that  tha  nlxad  binary  output  quantities  ara 
analog.  Howavar,  for  a  reaaonabla  nunbar  of  blta,  garhapa  up  to  32,  tha  dyaanlc  range  of 
tha  output  can  ba  wall  within  tha  capablllttae  af  tha  opto-elactronle  conponanta  being  uaad. 

If  a  natrix-vactor  (or  natrlx-natrlx)  groduct  la  to  ka  evaluated  ualng  tha  digital 
accuracy  by  convolution  technique.  It  la  nacaaaary  ta  locorgarata  tha  Individual  natrix- 
vactor  groducta  (tha  aarlal  product  calculat Iona)  into  a  larger  natrix-vactor  product  by 
partitioning.  Thle  la  llluetrated  la  Tig.  10,  where  tha  caaa  of  a  2x2  natrlx  aultlplylng  a 
2-coaponent  vector  la  canaldarad.  Each  caagoaant  la  aaaunad  to  ba  given  by  a  3-deelaal 
ragraaantat Ion  an  Input,  yielding  five  declaale  on  output.  Tha  eorreepondence  between 
locatlona  of  tha  natrlx  conponanta  and  tha  aub  natrtcee  repreeenttng  than  la  Indicated. 
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Figure  1.  Beam  modulation  (a)  and  beam  deflection  (b)  using  an  acousto-optic 
Bragg  cell. 


MULTI-CHANNEL  BEAM  MODULATOR  HULTl-CHAHHEL-BEAR  DEELECTQB 


(a)  (b) 

Figure  2.  Multi -channel  beam  modulation  and  deflection.  Schlleren  stop  in  (a) 
prevents  undiffracted  light  from  reaching  image  plane. 
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Figure  3.  Limitations  on  bean  modulator  case:  number  of  independent  addressing 
beams  limited  to  T/  T. 


Figure  4.  Beam  deflector  case:  M  Inputs  can  be  connected  to  no  more  than 
TB/M  outputs. 


Figure  5.  Beam  modulator  based  matrix-vector  multiplier:  (a)  general  system, 
(b)  first  crltlcol  moment  In  operation  ((c)  and  (d)  next  page). 
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Figure  6.  Beam  deflector-based  matrix-vector  multiplier.  Matrix  coefficients  0,1  are 
frequency  multiplexed  on  composite  grating  segments.  Loser  diode 
intensities  are  strobed  in  proportion  to  vector  components  x^. 
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Figure  7.  Modification  of  system  of  Fig.  6  for  matrix-matrix  multiplication,  The 
loser  diodes  ore  strobed  sequentially  with  different  column  vectors  of 
matrix  |. 


Multi-transducer 
acousto-optic  cell 


Integrating 
detector 


Figure  8.  Multi -transducer,  two-cell  system  for  matrix-matrix  product  calculation. 


goto 

Seajenclng 


to  i 
toto 
to  | 


cn  *  °nbn  +  °i2(^2i +  °i2^a 


Figure  9  (above).  System  for  calculating  the 
outer  product  of  two  vectors.  Not 
shown  ore  optics  for  spreading  and 
collecting  light  rays. 

Figure  10  (right).  Example  of  partitioning 
for  digital  matrix-vector  multi¬ 
plication,  Eoch  component  of 
original  2x2  matrix  Is  expanded 
Into  a  3x5  matrix,  with  similar 
expansions  for  vector  components. 
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OPTICAL  COMPUTING: 

THE  COMING  REVOLUTION 
IN  OPTICAL  SIGNAL 
PROCESSING 


Development  is  progressing  toward  a  new  generation  of  optical 
computational  devices  th.it  may  provide  for  ultra-high-speed 
matrix  algebra  and  for  the  density  of  interconnections  needed  in 
optical  supercomputers. 

By  H.  John  Caulfield,  John  A.  Neff,  and 
William  T.  Rhodes 


MO  IASER  FOCUS/ELECTRO- OPTICS 


NOVEMBER  1983 


>  Optical  signal  processing  haa  ita  root*  in  the 

nineteenth  century  work  of  Lori  Rayleigh,  Huy¬ 
gens,  Abbe,  Lipmann,  and  others,  and  ita  greatest 
promise  in  the  twenty-first  century.  Here  in  the 
late  twentieth  century,  a  a.  nail  number  of  optical 
processing  systems  (spectrum  analyzers,  syn¬ 
thetic-aperture  radar  processors,  ambiguity  func¬ 
tion  generators,  etc.)  have  already  supplanted 
their  electronic  counterparts,  and  others  may 
succeed  soon  (pattern  recognition,  direction  find¬ 
ing.  etc.).  The  advantages  of  optics  over  electron- 
.  ics  in  these  systems  include  some  combination  of 

lower  cost,  reduced  size,  lower  power  consump¬ 
tion,  higher  speed,  and  potentially  enhanced  reli¬ 
ability. 

Although  it  is  not  yet  realistic  to  plan  for  a 
general-purpose  optical  computer,  it  ia  possible  to 
think  seriously  about  fairly  general  optical-array 
processors,  as  suggested  by  Pig.  1,  that  can  be 
used  as  adjuncts  to  digital  computers  for  perform¬ 
ing  specific  algebraic  computations  at  very  high 
*j«*eds.  Designs  are  currently  under  considera¬ 
tion  for  ultra-high-speed  optical  processors  to 
evaluate  polynomials,  matrix-vector  products, 
matrix-matrix  products,  and  solutions  of  sets  of 
linear  equations. 

This  article  reviews  the  developments  of  the 
last  several  decades  that  led  to  this  position, 
describes  briefly  some  important  areas  of  current 
research  and  development,  and  lists  several  areas 
of  expected  major  future  development. 

Philosophy  ond  recent  developments 
Operations  performed  by  optical  systems  are  de¬ 
scribed  by  simple  muthematics:  convolution, 
multiplication,  integration,  etc.  It  requires  only  a 
minor  change  in  outlook  to  convert  from  mathe¬ 
matics  as  a  description  to  mathematics  as  the 
goal  of  the  optics.  Such  a  viewpoint  was  taken  by 
Cutrona  at  the  University  of  Michigan  as  early 
as  1955  when  he  described  the  application  of 
optical  systems  to  the  evaluation  of  general  su¬ 
perposition  integrals  and  to  the  multiplication  of 
a  vector  by  a  matrix.  Indeed,  many  of  the  early 
researchers  of  optical  signal  processing  sys¬ 
tems— Gabor,  Leith,  Cutrona,  Kozma,  Vender 
Lugt,  Stroke,  Mertz,  Lohmann,  Rogers,  Good¬ 
man-recognized  the  potential  of  optical  systems 
for  performing  a  variety  of  mathematical  opera¬ 
tions.  These  researchers,  and  many  after  them, 
concentrated  primarily  on  continuous  analog  op¬ 
erations  such  as  integral  transformations,  and  in 
that  sense  their  contributions  relate  well  to  earli¬ 
er  or  concurrent  developments  in  analog  electron¬ 
ic  computing. 


During  the  past  several  years  attention  has 
turned  to  a  different  application  of  optics  to 
mathematical  operations,  in  this  case  operations 
that  are  numerical,  sometimes  discrete,  and  often 
algebraic  in  nature.  Indeed,  the  redirection  of 
attention  has  been  so  vigorous  that  many  view  it 
as  a  small  revolution  in  optics:  optical  signal 
processing  is  beginning  to  encompass  what  many 
feel  is  aptly  described  as  optical  computing, 
where  the  term  is  fully  intended  to  imply  close 
comparison  with  the  operations  performed  by 
scientific  digital  computers.  The  optical-array 
processor,  mentioned  earlier,  forms  the  basis  for 
this  revolution.  (The  term  optica/  computing  has 
been  used  occasionally  for  nearly  two  decades 
now  in  connection  with  analog  optical  processors, 
but  a  major  fraction  of  the  optical  signal-process¬ 
ing  community  has  never  felt  comfortable  with  it 
because  of  the  implied  comparison  with  general- 
purpose  digital  computers.  That  situation  is 
poised  for  change.) 

In  retrospect,  the  beginning  of  modern  optical- 
array  processors  was  the  invention  of  what  is  now 
often  called  the  Stanford  optical  matrix-vector 
multiplier  (OMVM).  This  device,  illustrated  in 
Fig.  2,  has  a  capability  of  multiplying  a  100- 
component  vector  by  a  100  x  100  matrix  in 
roughly  20  ns.  Components  of  the  input  vector  x 
are  input  via  a  linear  array  of  LEDs  or  laser 
diodes.  The  light  from  each  source  is  spread  out 
horizontally  by  cylindrical  lenses,  optical  fibers, 
or  planar  lightguides  to  illuminate  a  two-dimen¬ 
sional  (2-D)  mask  that  represents  the  matrix  A. 
Light  from  the  mask,  which  has  been  reduced  in 
intensity  by  local  variations  in  the  mask  trans¬ 
mittance  function,  is  collected  column  by  column 
and  directed  to  discrete  horizontally  arrayed  de¬ 
tectors.  The  outputs  from  these  detectors  repre¬ 
sent  the  components  of  output  vector  y,  where  y 
is  given  by  the  matrix-vector  product  y  *•  Ax: 
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Since  light  intensity,  which  is  always  nonnega¬ 
tive,  is  used  to  represent  the  various  mathemati¬ 
cal  quantities,  special  coding  techniques  must  be 
employed  if  both  positive  and  negative  (or  com¬ 
plex-valued)  numbers  nre  to  be  accommodated. 

As  originally  conceived,  the  Stanford  OMVM 
suffers  from  several  potentially  serious  limitations: 
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•  Accuracy  i«  limited  by  the  accuracy  with  which 
the  source  intensities  can  be  controlled  and  the 
output  intensities  read; 

e  Dynamic  range  is  source  and/or  detector  limit* 

ed; 

e  Rapid  updating  of  the  matrix  A  requires  the 
use  of  a  high-quality  2-D  read-write  trans¬ 
parency— a  spatial  light  modulator  (SLM) — 
whose  optical  transmittance  pattern  can  be 
changed  rapidly.  Unfortunately,  such  a  device 
does  not  yet  exist  with  all  the  desired  characteris¬ 
tics,  although  candidate  devices  are  being  im¬ 
proved  rapidly. 

Despite  these  drawbacks,  the  Stanford  develop¬ 
ment  brought  about  an  important  swing  within 
the  optical  signal-processing  community  from  a 
preoccupation  with  coherent,  Fourier-transform- 
based  processors  to  incoherent,  geometrical  op¬ 
tics-based  processors.  It  is  interesting  to  note  that 
this  change  in  direction  was  initiated  by  Prof. 
Joseph  W.  Goodman,  whose  book  on  Fourier 
optics  had  enshrined  coherent  optics  so  firmly  in 
many  minds. 

The  speed  of  the  OMVM  (a  result  of  the  optical 
parallelism  in  the  system)  presented  researchers 
with  a  perplexing  problem:  the  processor  could 
operate  at  speeds  far  exceeding  the  ability  to 
input  and  output  data,  which  often  required 
digitization  for  compatibility  with  surrounding 
electronic  systems.  One  approach  to  circumvent¬ 
ing  this  problem  is  to  use  the  OMVM  for  iterative 
algorithms,  where  the  processor  output  is  direct¬ 
ed  in  analog  form  back  to  the  input.  A  variety  of 
iterative  processing  uses  of  the  device  were  devel¬ 
oped  by  Casasent,  Caulfield,  Goodman,  and 
Rhodes.  One  example  is  the  implicit  inversion  of 
matrix  equation  y  =  Ax  (i.e.,  solution  for  vector 
x,  given  vector  y  and  matrix  A)  by  the  iterative 
algorithm  (or  its  continuous-time  counterpart) 

•  x, . »  -  (I  -  A)  x,  +  y, 

where  I  is  the  identity  matrix. 

The  next  major  development  came  about 
through  pressure,  gently  applied,  from  a  small 
number  of  researchers  who  were  equally  at  heme 
in  both  electrcnic  and  optical  computing,  particu¬ 
larly  Harper  Whitehouse  and  Jeffrey  Speiser  at 
the  Naval  Ocean  Systems  Center  and  P.  Denzil 
Stilwell  at  the  Naval  Research  Laboratory. 
Through  their  persistence  they  convinced  the 
optical-processing  community  of  three  important 
things.  First,  algebra  without  high  accuracy  is 
not  very  useful  (see  box:  "The  Need  for  High 
Accuracy").  Second,  optics  can  achieve  high  accu¬ 


racy  in  the  same  way  electronics  docs — by  going 
digital.  This  led  to  the  first  suggestion  by  Psaltis 
of  a  means  to  achieve  digital  optics.  Third,  the 
newly  emerging  field  of  systolic-array  processing 
should  be  amenable  to  optical  implementation. 
This  latter  suggestion  led  to  work,  primarily  by 
Caulfield  and  Rhodes,  on  an  optical  systolic-array 
processor,  described  below.  Soon,  both  published 
and  unpublished  work  by  Tamura,  Casasent,  and 
others  advanced  this  area  greatly. 

Systolic-array  processing,  developed  principal¬ 
ly  by  H.  T.  Kung  at  Camegie-Mellon  University 
and  S.  Y.  Kung  at  the  University  of  Southern 
California,  is  an  algorithmic  and  architectural 
approach  to  overcoming  limitations  of  VLSI  elec¬ 
tronics  in  implementing  high-speed  signal-pro¬ 
cessing  operations.  Systolic  processors  are  char¬ 
acterized  by  regular  arrays  of  identical  (or  nearly 
identical)  processing  cells  (facilitating  design  and 


FIOURI  1.  A not  processor  (AP)  shown  hosted  by  o 
general  purpose  control  processing  unit  (CPU)  Two-way 
comrrmn  cations  are  via  a  data  bus 
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noun  2.  ihe  Stanford  motrin-vector  multiplier.  Not 
shown  in  the  tigure  ore  bght-spreoding  and  collecting 
optics. 
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fabrication),  primarily  local  interconnection*  be* 
tween  cell*  (reducing  signal-propagation  delay 
times),  and  regular  data  flows  (eliminating  syn¬ 
chronization  problems). 

Although  the  motivating  factors  are  different, 
systolic-processing  algorithmic  and  architectural 
concepts  are  also  applicable  to  optical  implemen¬ 
tation.  This  is  primarily  because  of  the  regular 
data-flow  characteristics  of  optical  devices  like 
acousto-optic  cells  and  CCD  detector  arrays,  and 
because  of  the  ease  of  implementing  regular 
interconnect  patterns  optically. 

An  example  of  an  optical  systolic  matrix-vector 
multiplier  is  shown  in  Fig.  3.  The  processor 
consists  of  an  input  LED  or  laser  diode  array, 
ccllimation  lenses  for  each  LED,  an  acousto-optic 
cell,  a  Schlieren  imaging  system,  and  a  linear 
array  of  integrating  detectors.  The  pedagogical 
example  of  Fig.  3  is  set  up  for  the  multiplication 


of  a  2-component  vector  by  a  2  x  2  matrix. 

The  first  input  to  the  acousto-optic  cell,  vector 
component  xlt  produces  a  shott  diffraction  grat¬ 
ing,  with  diffraction  efficiency  proportional  to  x:, 
that  moves  across  the  cell.  When  that  grating 
segment  is  in  front  of  LED  1,  as  shown  in  Fig. 
3(b),  the  LED  is  pulsed  with  light  energy  propor¬ 
tional  to  matrix  coefficient  an,  and  Integrating 
Detector  1  is  illuminated  with  light  energy  in 
proportion  to  the  product  ej|Xj.  The  next  crit'cal 
moment  occurs  when  the  xt  grating  segment  is  in 
front  of  LED  2  and  a  second  grating  segment, 
with  diffraction  efficiency  in  proportion  to  vector 
component  x2.  has  moved  in  front  of  LED  1,  as 
shown  in  Fig.  3(c).  At  that  moment  LED  1  is 
pulsed  with  light  energy  in  proportion  to  o12  and 
LED  2,  with  light  energy  in  proportion  to  o2 ,.  The 
integrated  output  of  Detector  1  is  now  proportion¬ 
al  to  auX|  +  al2Xf,  which  is  the  output  vector 
component  y,;  the  integrated  output  of  Detector  2 
is  021*1-  The  final  critical  moment  in  the  compu¬ 
tation,  shown  in  Fig.  3(d),  occurs  after  grating 
segment  x2  has  moved  in  front  of  LED  2.  A  final 
pulse  from  that  LED  in  proportion  to  an  yields  at 
the  output  of  Detector  2  a  voltage  in  proportion  to 
a2,x,  +  022X2,  the  second  component  y2  of  the 
output  vector. 

Much  like  the  Stanford  OMVM,  the  systolic 
opticsl  processor  described  has  a  dynamic  range 
and  accuracy  determinea  by  the  sources,  modula¬ 
tor,  and  detectors.  Output  accuracy  is  limited  to 
eight  to  ten  bite.  A  realistic  processing  capability 
for  such  a  system  would  be  the  multiplication  of  a 
100- component  vector  by  a  100  x  100  matrix  in 
approximately  10  ps.  This  is  much  slower  than 
the  Stanford  processor  speed;  however,  unlike  the 
latter,  the  systolic  system  does  not  require  a  2-D 
SLM,  and  the  matrix  can  be  changed  with  each 
opera -ton. 

Shortly  after  the  development  of  the  optical 
systolic  matrix-vector  multiplier,  two  important 
advances  took  place — the  invention  of  optical 
matrix-matrix  multipliers’ (see  box:  'Matrix-Ma¬ 
trix  Multipliers*)  by  Dias;  by  Athale,  Stilwell, 
and  Collins;  by  Bocker,  Bromley,  and  Caulfield; 
and  by  Casasent — and  the  achievement  by  Gull- 
foyle;  by  Athale,  Collins,  and  Stilwell;  and  by 
Bocker,  of  digital  accuracy  with  optical  algebraic 
processors. 

One  method  for  obtaining  high  digital  accuracy 
using  optical  processors  is  to  implement  digital 
multiplication  by  convolution.  This  method  was 
first  brought  to  the  attention  of  the  optical  sig¬ 
nal-  processing  community  by  Speiser  end  White- 
house  and  first  implemented  by  Psaltis  et  al.  The 

wammmmmmmmmmmmmmmmmmmmmmM 

lASfR  FOCISTI E CTRO-OPTlCo  10* 


6-37 


OPTICAL  COMPUTING 


method  ie  explained  with  the  aid  of  Fig.  4,  where 
base-2  multiplication  of  the  decimal  numbers  39 
end  15  is  performed  to  obtain  the  decimal  result 
655.  In  Fig.  4(a)  the  multiplication  is  performed 
in  normal  fashion:  Lines  1  and  2  contain  the 
input  binary  numbers  to  be  multiplied,  Line  3 
contains  a  mixed-binary  representation  of  the 
output,  and  Line  4  contain:  the  output  in  full 
binary.  In  the  mixed-binary  representation,  each 
digit  represents  the  multiplier  of  a  power  of  2; 
however,  unlike  full  binary,  the  value  of  the  digit 
is  not  restricted  to  be  0  or  1.  One  means  for 
converting  from  mxed  binary  to  full  binary  is 
shown  in  Fig.  4(b):  each  digit  of  the  mixed-bina  y 
representation  (Line  3)  is  expressed  in  full  binary 
form,  and  these  binary  numbers,  appropriately 
shifted,  are  added  using  a  standard  base-2  adder. 
The  resultant  binary  number,  1001001001,  is  the 
decimal  product  555  expressed  in  base  2. 

Binary  multiplication  via  convolution  is  possi¬ 
ble  because  the  intermediate  mixed-binary  repre¬ 
sentation  can  be  calculated  by  discrete  convolu¬ 
tion  (or  aerial  product)  of  the  binary  input  se¬ 
quences.  This  is  illustrated  in  Fig.  4(c).  Convolu¬ 
tion  of  binary  sequences  is  easily  accomplished  by 
acousto-optic  convolvers.  Since  only  l’a  and  0's 
need  to  be  represented  by  the  acousto-optic  cells, 
cells  can  be  operated  at  peak  diffraction  efficiency 
without  concern  for  nonlinear  response.  Further¬ 
more,  the  output  detector  is  only  required  to  have 
sufficient  dynamic  range  to  distinguish  between 
a  small  number  of  light  levels.  For  five-bit  in¬ 
puts,  as  in  the  example  considered,  the  output 
levels  will  range,  when  quentized,  from  sero  to 
live.  In  general,  A'-bit  inputs  require  that  N 


notlM  4.  Digital  muftplieotior  by  convolution: 

(a)  Multiplication  of  binary  numeers;  (b)  Conversion 
from  mOea  10  tun  binary,  (cj  Convolution  ol  Input 
sequences  to  obtain  mixed  binary  Intermediate  result 


TMl  NEED  FOR  HIGH  ACCURACY 
Computers  calculate  by  the  same  elementary 
operations  (addition,  subtraction,  multiplica¬ 
tion,  division)  that  humans  use.  The  result  of 
each  calculation  has  associated  with  it  an  uncer¬ 
tainty  O'  error.  Depending  on  the  number,  or¬ 
der,  and  nature  of  the  required  calculations, 
these  errors  can  be  multiplied  greatly. 

This  is  why  32-  and  even  64-bit  accuracy 
computations  are  sometimes  done  even  when  a 
d-bit  answer  will  suffice.  This  is  also  why  ana¬ 
log  solutions  (electronic  or  optical)  to  algebraic 
problems  must  often  be  avoided. 

In  electronics,  analog  computers  are  used  for 
high-speed,  easily  implemented  operations,  but 
digital  computers  are  used  for  algebra.  Not 
surprisingly,  optical  computation  makes  the 
same  division  of  tasks. 


levels  be  distinguishable  at  the  output.  Negative 
numbers  can  be  handled  using  2’s  complement 
arithmetic  or  other  methods. 

Tne  above  method  for  digital  multiplicatiosi  by 
convolution  can  be  used  in  a  variety  of  ways  in 
algebraic  optical  processors  to  obtain  higher  ac¬ 
curacy,  albeit  at  the  cost  of  (ewer  processing 
rates.  A  digital-accuracy  matrix-vector  processor 
conceived  by  Guilfoyle  achieves  high  processing 
rates  by  using  multitransducer  acousto-optic 
cells.  Athale.  Collins,  and  Stilwell  have  imple¬ 
mented  digital-accuracy  outer-product  matrix- 
matrix  multipliers  using  a  tingle  pair  of  acoasbt- 
optic  cells. 

Current  research  and  now  directions 
Efforts  undertaken  during  the  next  few  years  will 
be  in  two  directions.  First,  optical  matrix  comput¬ 
er  systems  based  on  the  concepts  we  have  been 
describing  will  be  built,  tested,  unproved,  and 
applied  to  new  areas.  Second,  new  types  of  non¬ 
matrix  optica!  computers  will  be  developed.  We 
will  touch  on  both  of  these  directions  briefly. 

In  optical  matrix  computers  the  two  thrusts  are 
implementation  and  extension.  To  date,  very 
little  implementation  has  taken  place.  Doing  this 
will  require  both  time  and  money;  it  now  appears 
that  these  will  be  provided.  Practical  issues  of 
component  selection,  electronics,  and  system  in¬ 
tegration  must  and  will  be  faced.  However,  the 
design  of  practical  optical  matrix  processors  still 
needs  new  ideas.  Complex  operations  must  be 
planned,  e.g.,  Kalman  filtering.  Kalman  filtering 
is  a  means  for  obtaining  the  statistically  best 
estimate  of  the  current  and  future  state  of  a 
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process  governed  by  a  known  differential  equa¬ 
tion  and  measured  in  a  fixed  way  with  known 
measurement  statistics.  Because  a  single  'cycle* 
of  a  Kalman  filtering  operation  involves  many 
matrix  calculations,  real-time  Kalman  filtering 
must  be  restricted  to  relatively  small  problems. 
Performing  the  matrix  operations  (triple  multipli¬ 
cations,  inversions,  etc.)  optically  may  permit  the 
handling  of  large  problems  in  real  time.  Casasent 
has  started  this  effort,  and  several  others  are 
working  on  it.  Either  floating-point  operations  or 
on-the-fly  scale  adjustment  is  needed.  Caulfield 
has  shown  that  both  are  possible,  but  his  solu¬ 
tions  are  probably  more  existence  proofs  than 
final  answers.  New  algorithms  are  needed  to 
extend  the  range  of  applications  and,  possibly,  to 
speed  up  calculations.  To  date,  all  important 
algorithms  have  been  iterative.  Noniterative, 
fully  parallel  solution  of  linear  equations  is  possi¬ 
ble  in  analog  optical  processors.  Can  similar 
things  be  done  for  digital  optical  processor*? 

Nonmatrix  optical  processors  are  developing 
independently  and  rapidly.  Perhaps  the  most 
widely  pursued  of  these  is  the  use  of  optics  to 
make  arbitrary  interconnections  among  electron¬ 
ic  (Goodman)  or  electro-optic  (Lohmann,  Lee, 
Collins,  Goodman,  Saw-chuck,  Strand,  etc.)  sys¬ 
tems  .  Sawchuck,  Strand,  ar.d  their  coworkers 
have  implemented  a  variety  of  space-variant  and 
space-invariant  interconnect  patterns  using  com¬ 
puter  holograms  to  generate  the  patterns  and 
spatial  light  modulators  to  feed  the  information 
back  into  the  system.  Their  system  (like  those 
due  to  Lohmann,  Lee,  Collins,  etc.)  closes  on  itself 
for  feedback.  Clesrly,  however,  this  is  not  the 
only  configuration.  Feedforward  configurations 
lead  to  a  variety  of  optical  artificial-intelligence 
systems. 

The  continuing  demand  for  higher  throughput 
rates  will  drive  future  research  toward  higher 
speeds  and  greater  parallelism.  In  these  large 
systems,  or  supercomputers,  of  the  future,  a  ma¬ 
jor  problem  in  achieving  high  throughput  rates 
will  be  how  to  facilitate  generalized  communica¬ 
tions  among  the  large  number  of  processing 
units.  In  a  general-purpose  computer,  the  full 
advantage  of  parallelism  will  only  be  realized  if 
each  processing  unit  has  direct  communication 
with  every  other  unit,  thus  permitting  each  to 
handle  a  part  of  the  action  on  a  continuing  basis. 

The  highest  level  of  communications,  or  inter¬ 
connect  as  it  it  called,  entails  a  generalized 
crossbar  network  involving  N2  interconnects 
available  for  N  processors  (N  units  communicat¬ 
ing  with  N  units),  at  shown  in  Fig.  5.  Such  a 
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FIOURI  8.  Generated  crossbar  network. 
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network  become*  very  expensive  when  imple¬ 
mented  electronically  for  large  N,  but  the  inher¬ 
ent  parallelism  of  optics  holds  great  potential  for 
inexpensive  and  high-speed  crossbar  switching. 

The  generalised  crossbar  can  be  expressed  ana¬ 
lytically  in  terms  of  a  vector-matrix  multiplica¬ 
tion,  so  optical  algebra  forms  the  basis  of  solving 
the  interconnect  problem.  For  example,  consider 
the  Stanford  OMVM  described  previously.  Let  x* 
and  y*  be  the  vectors  of  the  crossbar  input*  and 
outputs,  respectively,  and  let  A  represent  the 
interconnect  switch  settings.  That  ia,  ■  1  if, 
and  only  if,  the  ith/output  is  connected  to  the  yth/ 
input.  Otherwise,  ov  -  0.  The  OMVM  with  these 
o¥’$  automatically  makes  the  desired  connections 
optically.  Note,  too,  that  numerical  accuracy  is 
not  an  issue  for  this  application. 

The  Stanford  processor  is,  of  course,  nonpro¬ 
grammable;  therefore,  it  can  only  be  used  in  a 
system  with  a  pre-established  set  of  intercon¬ 
nects.  If  one  were  to  replace  the  matrix  filter  with 
a  real-time  device  such  as  a  2-D  spatial  light 
modulator,  then  a  switchable,  generalized  cross¬ 
bar  becomes  a  possibility;  likewise,  the  binary 
matrix  mask  could  be  replaced  with  a  hologram. 
Going  one  step  further,  one  begins  to  envision 
generalized  crossbars  with  picosecond  switching 
speeds  via  real-time  four-wave  mixing  or  an 
optically  addressed  bistable  array.  Such  a  caps- 
bility  would  bring  us  into  a  realm  of  computer 
communications  beyond  the  wildest  dreams  of 
electronic  interconnection  architects. 

A  more  structured  optical  arrangement  ia  the 


NOttRI  *.  Architecture  for  performing  iterative  pro- 
ceiling  with  the  RU8tC  Cube,  using  feeooock. 


fiberoptic  lattice  filter  (Tur,  Goodman,  etc.). 
When  the  computational  problem  haa  sufficient 
symmetry,  a  full  matrix  approach  may  be  an 
inelegant  and  expensive  approach.  The  lattice 
filter  work  represents  an  exploration  of  simpler 
systems  for  simpler  problems.  A  very  common 
problem  in  algebra  is  the  evaluation  of  polynomi¬ 
als.  If  an  analog  optical  polynomial  evaluator 
could  be  built,  it  would  be  possible  to  find  the 
roots  of  polynomials  in  a  totally  new  way:  scan 
the  independent  variable^)  and  see  where  Use 
roots  occur.  This  leads  to  a  solution  of  another 
long-standing  optical  problem  as  well.  The  quo¬ 
tient  1/a  is  simply  the  root  of  the  function 
(1/x)  “  a,  which  can  be  evaluated  efficiently  in 
polynomial  form.  Work  along  this  line  is  being 
carried  out  (Verber,  Caulfield,  Ludntan,  Stilwell, 
etc.).  Since  holographic  memory  technology  al¬ 
lows  ready  content-addressable  access  to  vast 
amounts  of  data,  a  truth-table  lookup  processor 
appears  both  feasible  and  appealing.  This  ap¬ 
proach  is  now  being  studied  closely  (Gaylord, 
etc.). 

Finally,  all  of  these  optical  computers  are  in 
need  of  improved  or  specialized  components.  A 
major  DARPA-sponsored  effort  to  improve  spa¬ 
tial  light  modulators  is  just  beginning.  This 
seems  likely  to  lead  to  improved  throughput  rates 
by  providing  a  2-D  medium  capable  of  1000  x 
1000  individually  addressable  modulator  ele¬ 
ments,  a  cycle  rate  (READ/WRITE  time  cycle)  of 
1  kHz,  a  dynamic  range  of  30  dB,  and  less  than 
991  spatial  nonuniformity.  Other  needs  include 
source  and  detector  arrays  that  are  compatible  in 
resolution,  intensity,  and  dynamic  range  with 
these  spatial  light  modulators  and  that  possess 
individually  addressable  elements. 

Conclusions  and  outlook 
Upon  considering  the  broad  area  of  optical  alge¬ 
bra,  including  parallel  algorithms,  architectures, 
devices,  and  their  associated  materials,  a  large 
spectrum  of  interesting  and  important  research 
areas  comes  to  "light.”  As  the  national  interest  in 
the  computational  sciences  begins  to  shift  toward 
the  supercomputers  envisioned  for  the  1990s,  it 
will  be  vitally  important  for  the  optics  communi¬ 
ty  to  pursue  those  research  areas  for  which  optic* 
holds  the  greatest  appeal,  such  a*  large-scale 
matrix-matrix  or  matrix-tensor  operations  and 
processor  inter-  and  intracommunications.  We 
must  also  allow  ourselves  to  look  past  the  re¬ 
search  discussed  above  and  into  the  use  of  optics 
to  perform  real-time  circuit  reconfiguration.  For 
example,  light  could  be  uted  to  modify  the  index 
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of  refraction  within  waveguide*  in  each  a  manner 
aa  to  change  channel  layout*  and  beam -control 
element*  on  a  circuit  module,  thereby  adding 
much-needed  flexibility  to  optical  comp. 1  ing. 
These  new  directions  are  mentioned  to  con  .  y  to 
the  reader  something  of  the  excitement  of  n  ield 
that  is  not  only  maturing,  but  also  expanding. 
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The  Ikin/Miial  ci-culjr  loop  ami  the  coastal  array  of  lours  box  a  material  halt  space  art  studied  as 
antennas  lor  dircctne  transmission  into  the  hal'  space  >  ■  practical  situation  the  loop"  might  be 
located  in  air  taith  the  direevse  transmission  into  the  eerin  In  determining  the  optimum  geometry  lor 
the  single  loop  and  the  anas,  the  far-rone  Acid  patterns  >  .  1  directivities  of  these  antennas  when  placed 
over  lossless  dielectrics  are  lontidcftd  first  The  directive  pi  ope  rues  lor  the  lossless  dielectric  are  found 
to  be  indicative  of  those  fur  the  tame  antenna  over  a  medium  with  low  loss  when  proper  account  is 
taken  of  the  eaponential  attenuation  experienced  in  the  lossy  medium  Parametric  studies  are  used  to 
Obtain  the  maximum  dir-ti.l.es  tor  these  snicnnss.  For  the  single  loop  of  resonant  sire,  die  optimum 
height  user  the  interface  i'  determined,  and  for  the  iwo-elemcnt  array  consisting  of  a  driven  loop  of 
resonant  sire  with  a  stm-ie  parasite,  the  optimum  siec  and  spacing  of  the  parasitic  reflector  arc  found 
Measured  electric  field  raim-ftt  and  gains  of  model  antennas  above  an  interface  between  air  and  fresh 
water  are  in  good  agreement  with  the  theoretical  results 


INTRODUCTION 

In  an  earlier  paper  [  An  and  Smit'i  1982]  a  compre¬ 
hensive  theoretical  analysis  with  experimental  confir¬ 
mation  was  presented  for  the  d'cular-loop  antenna 
near  a  planar  interface  separating  two  semi-infinite 
material  regions.  The  numerical  results  presented  in 
that  work  showed  that  a  loop  in  free  space  over  a 
material  half  space,  such  as  ihe  earth,  could  base  a 
directive  field  pattern  into  the  half  space  when  the 
loop  is  close  to  the  interface  and  near  resonant  sire 
(the  circumference  of  the  loop  is  approximately  one 
wavelength  in  free  space).  In  this  paper  the  analysis  is 
extended  to  treat  a  coaxial  array  of  circular  loops 
above  the  interface,  and  numerical  results  are  pre¬ 
sented  that  demonstrate  the  optimization  of  the 
single  loop  and  the  two-element  array  (driven  loop 
with  a  parasitic  reflector)  for  maximum  directivity 
into  the  half  space. 

COAXIAL  ARRAY  OF  CIRCULAR-LOOP  ANTFNSAS 

The  coaxial  array  of  n  circular-loop  antennas 

|ju  I.  2.  3 . n)  over  a  planar  irtcilavr  is  shown  m 

Figure  I.  Lach  of  (he  per'ectly  conducting  loops  is 


driven  at  the  angular  position  d“Oby  ad*1"' 
lion  generator  of  voltage  IJ,..  The  radius  it  t.  .  n 
conductor,  the  radius  of  the  loop,  and  the  of 

the  loop  above  the  interface  are  denotes  ny  .  b,. 
and  A,,  respectively.  The  spacing  between  i  and 

For  a  harmonic  time  dependence  expi  u)  the 
electrical  constitutive  parameters  for  the  h  f  space, 
region  I.  containing  the  loops  are  the  e.Tsctive  per¬ 
mittivity  c„  and  the  effective  conduct!'  ;y  the 
parameters  for  the  other  half  space,  rep  on  2.  are  c,, 
and  o,j .  Both  materials  are  assumed  to  be  nonmag¬ 
netic.  it,  =  ii t  **  jr«.  The  complex  wave  number  in 
either  medium  is 

L, »  /I,  —  /x,  -  tuiiii, ty  1  a,  £  0  (It 

where  4,  »  c„  -  jo„/w.  and  the  wave  impedance  is 

C.  »  Oh, 

The  current  in  each  of  ihe  driven  loops  cm  be 
expressed  as  a  Fourier  cosine  series: 

f,(4>\  »  £  Mm)/,,  cos  (h 


1  V»  ii  Bell  Telephone  l  ihoriiorm 
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Hm)  *  2  «#0 

The  coefficients  /„  are  determined  hy  requiring  the 
tangential  component  of  the  electric  field.  F.t .  to  sat- 
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(33c)  of  An  and  Smith  [1982]  with  p  m  fc,,  r  «  —  h„ 
h  =  b,,  and  h  «*  b/ 

G^m)  »  -  j"  (»., 

-  ii.  (9«) 

Cml (m» »  £  I*,  >7|I[«V/|/*|IJ*J',  A,. 

+  b  h,/.‘RJi.  k„  <t>.  (9M 


\  \  /  '  *  i-1,2  (IOi 

^  j  and  the  relief'1  a  coefficients  R,  and  Rm  »re 

\.*»l-(*».7. -7jl(*ii7, +7jl  (Hoi 

f  ig  I  r<H»Ml  array  of  circular-loop  animnu  near  a  planar  X  ia,  4,»  «  (y,  -  7jM7i  *  7jl  (ll/>) 

inierfjcr 

with  A,,  «  kj/fc,  a  |;4,j.  After  inserting  (9a)  and 
isfy  the  boundary  condition  at  the  surfaces  of  the  (9h)  into  (8)  and  some  rearrangement, 
perfectly  conducting  loops:  »,  ,i„ 

r“'“l  {"T5 

Ml  *  1 '  1 

♦  —  b,t>,rjih,UJ>h,%.RJi,  k„  k,l  + 

r»  1.2.3.  -.n  7|  ’ 

.  f* C2| 

The  three  field  components  in  (4)  are  the  primary 

field  of  the  isolated  loop,  which  is  the  field  of  the  ith  where  h„„  =  min  (fc,.  hy).  Note  that  r^y  =  cm/l. 
loop  when  it  is  in  an  infinite  medium  with  the  When  (5).  (6).  and  (7)  are  inserted  in  (4)  and  the 
properties  of  region  I :  della  function  expanded  as  a  Fourier  cosine  series,  a 

..  .  set  of  linear  equations  results  for  the  coefficients /M: 

1.^.  --  -  J]  him\tmam  cos  (n^)  |5|  —  .  . 

2  ■  K,  (a«  +  bmMm  +  V  r -  f„.  113) 

the  secondary  held,  which  is  due  to  the  interaction  of  '* ) 

the  rth  loop  with  the  half  space,  region  2:  jm,  ^  j  ...  m 

-  —  £  HmV„  (>_  cos  (liidl  (6)  or  in  matrix  notation. 

2b,  m .  p 

and  the  field  due  to  the  current  in  the  jth  loop,  which  HJIAJ  t  •] 

also  includes  the  interaction  of  the  ylh  loop  with  the  vthere  the  elements  in  the  symmetric  n  x  n  admit- 

half  space:  tance  matrix  [  F..J  are 


£„>  -  — —  £  him |/.y  cos  I 

*4>,  a.O 


“/xCi(o«  +  '’-I  '  -/ 

F-J-K  |f-u  1 


Formulas  for  the  coefficients  a„,  and  h,(  in  (S)  and 

(6)  are  given  by  King  and  Smith  [1981]  and  An  and  The  mth  Fourier  series  coefficients  for  the  currents 
Smith  [1982],  respectively.  The  coefficients  r—)  are  on  all  the  loops  are  determined  by  solving  the  system 

of  linear  equations  (14a)  for  the  column  vector  [/_} 
‘-ii  “  cr/l"l  +  G„i<">  (»)  For  numerical  evaluation  a  finite  number  of  terms 

where  G,,,  and  G-y  are  given  by  equations  (32 c)  and  are  used  in  the  Fourier  series  (typically  20  terms),  so 
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(hit  m  m  0,  I.  2.  •  •  • .  .  Thus  there  ire  sys- 

tems  of  equations  (14a)  that  must  be  solved  to  com¬ 
pletely  specify  the  currents. 

DESCRIPTION  OF  DIRECTIVE  PROPERT'CS 

With  the  Fourier  scries  coefficients  for  the  currents 
in  the  transmitting  loops  determined,  the  total  elec¬ 
tron  .agnetic  field  in  either  region,  I  or  2.  is  deter¬ 
mined  by  summing  the  fields  produced  by  the  indi¬ 
vidual  loops.  Formulas  for  the  field  of  a  single  loop 
are  given  by  An  and  Smith  [1982.  equations  (36)  and 
($$)].  Note,  in  Figure  I.  that  the  spherical  coordi¬ 
nates  (r.  0.  <pt  and  the  components  of  the  electric 
field  f., .  E, .  and  Et  are  equivalent  to  (r\  Of.  $')  and 
Er.  E9 .  and  E,  of  An  and  Smith  [19X2] 

The  aforementioned  formulas  for  the  electro¬ 
magnetic  field  are  complicated  integral  expressions, 
similar  to  Sommerfeld  integrals,  that  apply  at  any 
field  point  (r.  0.  oi  Some  simplification  of  these 
expressions  is  desirable  when  they  are  used  for  a 
parametric  study  of  the  array  s  directive  properties. 

The  case  of  interest  is  the  one  where  the  antenna  is 
used  lor  directive  transmission  into  the  lower  half 
space,  region  2.  Here,  one  is  primarily  interested  in 
concentrating  the  electromagnetic  field  in  the  lower 
haif  space  at  polar  angles  near  and  minimizing 
the  electromagnetic  field,  or  power  radiated  and  div 
sipatrd.  in  the  upper  half  space.  Antennas  with  these 
properties  are  useful  in  communications  systems  with 
transmission  from  the  -rrfacc  to  below  ground  and 
in  detection  systems  where  signals  incident  from  the 
surface  arc  scattered  from  buned  objects.  Systems  of 
this  type  are  practical  mainly  when  the  dissipation  in 
the  material  half  space  (region  2|  is  low.  so  that  the 
electromagnetic  field  will  penetrate  to  a  significant 
depth,  i.e.,  the  exponential  attenuation  r'"’  for  a 
spherical  wave  originating  a'  the  surface,  point  S  in 
l-'igurc  .1,  must  not  be  excessive  at  the  radii  of  in¬ 
terest. 

The  directive  properties  of  an  antenna  in  an  infi¬ 
nite  lossless  dielectric  medium  are  easily  described  in 
terms  of  the  angular  dependence  of  the  far-zone  elec¬ 
tric  field  EV,  0.  0)  which  has  the  asymptotic  behav¬ 
ior  0(e*'*7r).  k  being  real.  For  the  antenna  over  the 
half  space,  the  behavior  of  the  field  is  complicated, 
particularly  when  there  is  significant  dissipation  in 
the  media.  At  angles  8  not  near  n/2,  the  asymptotic 
behavior  in  the  upper  half  space  may  be  ()(e  '*l'/r). 
and  in  the  lower  half  space  0{r  '*"/')  Near  the 
interface  (0  near  n/2)  a  "surface  wave"  may  domi¬ 
nate  with  the  asymptotic  behavior  0(  l/rJ). 


For  media  with  low  loss,  the  terms  of  (Xr~*'ir), 
i  -  1. 2.  in  the  asymptotic  expansion  for  the  field  can 
provide  a  useful  description  of  the  antenna's  direc¬ 
tive  properties,  particularly  when  the  field  at  angles 
near  0  -  it  and  not  near  8  «=  a/2  is  of  interest.  The 
terms  of  0{e~*'lr)  are  the  “geometrical  optics"  field 
[ Brekhoxskikh .  I98C;  U'ait.  1969]  and  will  be  re¬ 
ferred  to  as  the  far-zone  field  E'(r,  8.  <t>)  of  the  an¬ 
tenna  over  the  half  space.  For  the  array  of  circular 
loops,  the  far-zone  electric  field  in  region  I  is 


£>.  II.  4)  -  ^  — 


■  cos  l»id)[2..|U|6,  sm  0)  —  7,.,(* ,6.  sin  0)]i  (15a) 


L'Jir.  I).  <#>)  -  ctn  0  - 


t  {te*" 


>  _  R 


sin  0)} 


£  r»i/„.  sin  (nrdld.IL  ,6, 


£>.  8.  d)  -  0 

with  the  reflection  coefficients 

^  cos  0  -  it],  -  sin^ff)1-* 

*  cos  0  +  (*],  -  sin1  0*‘  ‘ 

*  ] ,  cos  0  —  (tj,  -  sin*  0)1 1 
*},  cos  0  +  (*],-  sin2  0|‘  ‘ 

The  far-zone  electric  field  in  region  2  is 


/.;«•.  0.  d>  -  'j  T,  — 


■  £*|6,  jr'*'**"*  £  Mm)Tcoi(m^) 

/»[d..  ,(*,6.  sin  0)  -  /..,(*,/>,  sin  0)]j 

F.'/r.  0.  d)  -  f,  ctn  0T,  — —  £ 

“  I- I  i 

'  £  /*m/„  sm  sin  0)1 

»•  i  ) 


0.  d>  -  0 
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*67 


with  ff  determined  from 

tin  (T  -  t,,  tin  0 

nr.d  the  transmission  coefficients 
_  2|cos0| 

1  “  |eot  0|  +  (*},  -  tin1  0)' 1 


Only  one  component  of  the  electric  field,  the  0  com- 
..  ponenl,  appears  in  (19),  because  the  field  is  linearly 
(,W)  polarized  at  0  -  a. 

A  special  case  of  interest  is  that  of  lossless  media 
(a^  —  0,  /  —  I,  2)  and  a  lossless  antenna.  For  this 
(16*)  case<  *  directivity  can  be  defined  for  the  array,  and  it 
is  equal  to  the  gain 


ft  “ 


2*„|co»0| 


*{,|cot  0|  +  (Jtjj  -  sin1  0)‘ 


(16/)  D(0-i.)-4«|fr/r.0-ir)|^Ca  EC.lFcl^  ’  (20) 


For  (1$)  and  (16)  to  be  useful  in  a  practical  situation, 
the  field  must  be  well  approximated  by  the  asymp¬ 
totic  form  (far-zone  fi.ld)  at  radii  r  that  are  not  so 
great  that  the  exponential  attenuation  (|e"'*'|  *» 
e  ")  has  reduced  the  field  to  an  impracticably  low 
value 

The  standard  measure  for  the  directive  properties 
of  an  antenna  in  an  infinite  lossless  dielectric 
medium,  such  as  free  space,  is  the  antenna  gain : 


4itr  V*  Re  (S;<r.  0.  0)1 
Go(0.  0) - ~ - 


(17) 


where  S;  is  the  complex  Poynting's  vector  in  the  far 
zone.  /•„  is  the  lime  average  power  supplied  to  the 
antenna,  and  Re  indicates  the  real  part.  The  gain  G„ 
is  independent  of  the  radial  distance  r. 

For  the  antenna  over  the  planar  interface,  both 
media,  regions  I  and  2.  may  be  dissipative,  in  this 
case,  if  the  definition  (17)  is  used  for  the  gain,  the 
gain  will  be  dependent  on  the  radial  distance  r.  since 
the  exponential  factor  r'u '  will  appear  in  the  nu¬ 
merator.  A  similar  problem  is  encountered  when  the 
gain  of  an  antenna  in  an  infinite  dissipative  medium 
is  considered  [Aing  and  Smith.  1981 ;  Moore ,  1963]. 
The  following  definition  is  proposed  for  the  gain  of 
the  antenna  over  the  planar  interface : 

Aw V“V-  Re  [S;<r,  0.  $)] 

G<0.  d )- - y- - -  (IS) 

where  (•  I  for  0  S  0  <  n/2  and  i  «  2  for  */2  < 
0  S  it.  The  inclusion  of  the  exponential  factor  in  the 
numerator  of  (IS)  makes  the  gain  O'  independent  of 
r. 

For  the  array  of  loops  the  gain  in  the  direction 
0  =  a  is 

G(0  -  «)  -  4*  Re  (C ,)|  ••**•'£>.  0  -  «)|* 

■(iCll1  i.G,U'0,|>)  ‘  (19) 

where  G,  is  the  input  conductance  of  the  Kh  loop. 


The  front-to-back  ratio  of  the  array  for  lossless 
media  is 


{,|£yr.6-«)|, 
0-0)  Is 


(21) 


NUMERICAL  RESULTS  AND  DISCUSSION 

The  case  of  a  single  driven  loop  (i  =  I)  with  a 
single  parasitic  rel'ector  0  =  2.  y0!  -  0)  will  be  used 
to  illustrate  the  directive  properties  of  the  loop  array 
over  the  half  space.  Even  with  only  two  loops,  sev¬ 
eral  parameters  are  still  needed  to  describe  the  an¬ 
tenna  and  media.  The  following  assumptions  are 
made  to  simplify  the  optimization  of  the  array: 
medium  1  is  assumed  to  be  free  space  or  air  (t„  = 
t0.  a,,  »  0).  the  radius  of  the  driven  loop  (loop  I)  is 
taken  to  be  fi0  h,  =  1.0  (a  loop  of  this  size  was  shown 
previously  to  have  useful  directive  properties),  and 
the  radii  of  the  conductors  of  both  loops  are  taken  to 
be  equal,  a,  =  a,  with  O,  =  2  In  (2n6,  /a,)  <*  20. 
The  directivity  of  the  single  driven  loop  is  optimized 
first  by  adjusting  its  height  A, ;  then  the  directivity  of 
the  array  is  optimized  by  adjusting  the  height  h2  and 
(he  radius  h2  of  the  parasite. 

Media  which  roughly  correspond  to  fresh  water 
and  moist  earth  are  considered  fo.  region  2,  these 
have  the  relative  effective  permittivities  i„2  =  c,j  'c* 
equal  to  80  and  10,  respectively.  The  effective  con¬ 
ductivities  o,i  of  the  media  are  assumed  to  be  zero  in 
the  optimization  (lossless  media,  loss  tangent  p,2  = 
o,2 /tot,,  «■  0).  Later,  the  directive  properties  of  the 
array  for  low  loss  in  region  2  art  shown  to  be  s.milar 
to  those  with  no  loss. 

Single  loop.  In  Figure  2  the  directivity  D(6  *»  rt) 
(equation  (20))  of  a  single  loop  (p0b  =  1.0.  D,  =  20) 
is  shown  as  a  function  of  the  height  h,  //.0  above  the 
interface:  the  relative  effective  permittivity  t„2  of  the 
lossless  half  space  is  the  parameter.  The  front-to- 
back  ratio  F  (equation  (21))  for  the  same  case  is 
shown  in  Figure  3. 
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Fig  l  Directivity  of  tingle  loop  in  air  liM  -  I.  p„  -  0)  over  lottlcts  dielectric  half  tpace  ptl  -  0)  it  a 
function  of  the  he  (hi  above  the  interface  h,/l,  with  the  relative  permittivity  im,  aa  a  parameter;  ■  10. 
O.  -  X 


The  directivity  is  seen  to  have  a  peak  when  the 
loop  is  close  to  the  interface.  A,/20  *  0.075.  The 
amplitude  and  sharpness  of  the  peak  increase  with 
increasing  permittivity;  the  maximum  directivity  for 
t„i  »  80  is  about  130  (21  dB).  and  for  t„,  «>  10  it  is 
about  21  (13  dB).  The  front-to-back  ratio  increases 
monotonically  as  the  loop  approaches  the  interface. 

<  0.2$  Far-zone  electric  field  patterns  for  the 


loop  at  the  optimum  height  (h,//.„  **  0.07$)  and  for 
media  with  c„2  =  80  and  10  are  shown  in  Figure  4. 
The  patterns  show  the  field  component  £,  in  the 
plane  £  «  0.  *  and  the  field  component  £,  in  the 
orthogonal  plat.e  #  -  «,'2.  in/2. 

The  peak  in  the  directivity  is  easily  understood  if 
the  isolated  loop  is  considered  to  emit  a  spectrum  of 
plane  waves;  some  of  these  are  propagating  waves 


Ftj.  J.  Fronf-lo-bact  ntio  of  tingle  loop  ia  th  (»„,  -  I.  p„  »  0)  over  lottlcu  dielectric  half  ipacr  . 
f.i  ■  Ot  «  ■  funriton  of  the  height  above  the  interface  k,/*^  with  the  relative  penmltmty  tmt  as  a  parameier; 
Mi  -  I  0.Q,  -  20. 
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Fig  4.  Magnitude  of  electric  fkk)  components  in  br  zone  of 
single  loop  in  air  -  I.  p,t  Of  o*cr  femicis  dielectric  half 
space  (r^t.  »•  0i;  0oh,  «*  1.0.  O,  =*  20.  »  0075.  (<j|  Air- 

water  interface  -  W)  (M  Air*earth  interface  tw]  -  10. 

with  respect  to  the  direction  -d:  others  are  evanes¬ 
cent  with  respect  to  this  direction.  The  propagating 
waves  in  the  spectrum  of  the  isolated  loop  appear  in 
the  far-zone  pattern  of  the  loop  oser  the  interface 
within  the  cone  described  by  the  critical  angle 


i.e..  in  the  medium  at  angles  a  -  0,„  sls«  The 
evanescent  waves  appear  outside  this  cone.  For  the 
medium  with  t„,  -  SO.  0,,,  —  6.4s,  and  for  the 
medium  with  e„ j  ”  10,  -  18.4’.  When  the  loop 

is  very  close  to  the  interface.  ht/X0  «  0.075,  the  eva¬ 
nescent  waves,  which  appear  in  the  field  pattern  at 
angles  */2  <  0  <  *  —  0,„,  broaden  the  pattern  (add 
side  lobes)  and  decrease  the  directivity  (for  example 
see  Figures  10  and  II  of  An  and  Smith  [1982]).  As 
the  loop  is  raised  above  the  interface,  h,//.0  in¬ 
creases;  the  evanescent  waves  are  exponentially  at¬ 
tenuated  and  become  less  significant  in  the  far-zone 
pattern;  the  pattern  narrows  to  one  with  a  width 
roughly  equal  to  20,, , ;  and  the  directivity  increases. 
The  back  lobe,  the  pattern  in  free  space  at  the  angles 
0  £  0  <  n/2,  increases  as  the  loop  is  raised  above  the 
interface.  The  increase  in  the  back  lobe  decreases  the 
directivity.  The*  two  competing  effects,  the  increase 
in  the  directivity  due  to  the  decrease  in  the  width  of 
the  main  beam  in  the  medium  (region  2)  and  the 
decrease  in  the  directivity  due  to  the  increase  in  the 
back  lobe  in  free  space  (region  I),  give  rise  to  the 
peak  in  the  directivity. 

tn  a  practical  application  the  media  would  not  be 
lossless,  and  the  field  at  a  finite  radius  r  would  be  of 
interest.  The  results  in  Figure  5  indicate  the  effects 
these  two  factors,  dissipation  in  region  2  and  a  finite 
radius,  have  on  the  field  patterns.  Here,  the  patterns 
are  for  the  thtec  components  of  the  electric  field.  £,, 


t"  i  >?„ '  <r<t,  i 


Fig  5  M i, ml ude  cif  normaltred  ekciric  field  components.  r^|E(.  on  i  sphere  of  radio*  do r  ■  10  for  single 
loop  in  air  (£„,  m  I  .  p.,  -  Olover  dtsstpamc  half  space  t'„,.p. ,).  d.h,  -  1 0.  f),  »  Xt  •  007J  lot  Ait  .water 
interface  -  80,  p,,  -  0.1.18)  Air-canh  interface  -  10,  p.,  -  0  I. 
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Fig  6.  Directivity  of  loop  with  single  parasite  in  air  |r„,  *  I, 
p, ,  -  Ol  over  lossless  dielectric  half  space  1 1„,  »  fcQ.  p,2  *  0»  as  a 
function  of  the  size  and  the  spacing  of  the  parasite.  p0b%  «  1  0. 
ht  /0  *■  0075. 0,  «  20.  <ij  «  u, 

and  F.t,  computed  on  a  sphere  of  radius  ft0r  = 
10.  Results  arc  shown  Tor  media  with  i„}  -  80  and 
Ht;  the  loss  tangent  lor  both  is  />,,  =0.1.  Note  that 
the  field  components  arc  multiplied  by  the  factor  e ". 
i=  I.  2,  to  show  the  detail  of  the  patterns  in  the 
region  with  dissipation,  region  2  f£„,  =  80.  r  ~“'  = 
0.01 1 ;  e„,  =  10,  e'  «=  0.21).  These  patterns  arc  for 
loops  at  the  same  height  as  in  Figure  4.  ft,  = 
0.075.  A  comparison  of  Figures  4  and  5  shows  that 
the  directise  properties  for  the  loop  over  a  lossless 


Fit,  7  Directivity  of  loop  with  tingle  pjtatne  in  tir  (/„,  »  I. 
p„  “  Oi  over  kmlevi  dielfttric  half  tp.ee  -  to,  p(I  -  0)  it  i 
function  of  ihe  ore  and  the  ipacing  of  the  pantile.  -  1.0, 
»,//«-  007S.D, 


Fig.  H  Magnitude  of  electric  held  components  in  far  tone  of 
loop  with  tingle  paratite  in  air  (|„,  -  I.  p„  -  Ol  over  lottlrvi 
dielectric  half  ipae*  hmi.  r,;  •  0).  p„A.  »  I  <X  *,  >„  -  0075. 
p„h.  v  I  02$.  d  >„  -  0 1.  Ik,  »  70.  a,  «  a,,  lul  Ait-waicr  interface 
t.j  -  HO  Ihl  Ait-earth  tntcitace  rwj  -  10. 


medium  predicted  using  far-zone  results  are  qualita¬ 
tively  indicative  of  those  for  the  loop  over  a  medium 
with  low  loss  at  a  finite  radius.  Note  that  the  pat¬ 
terns  in  Figure  5  have  structure  at  angles  near 
0  *  it, '2  not  predicted  by  the  far-zone  results. 

Loop  with  parasite.  Graphs  of  the  directivity 
D(0  «  it)  (equation  (20);  for  a  loop  with  a  single 
parasitic  reflector  are  shown  in  Figures  6  and  7; 
these  results  are  for  lossless  media.  p,3  ■»  0.  with 
t„i  =  80  and  10.  The  driven  loop.  /J0fi,  ■=  1.0.  is  at 
the  height  A,//.„  *=  0.075  that  was  previously  deter¬ 
mined  to  be  optimum,  and  the  radius  fi0h}  and  the 
spacing  rf *du//.0  of  the  parasite  are  varied  to 
obtain  contours  of  constant  directivity.  The  opti¬ 
mum  directivity  for  e„2  “  8°  is  about  25  dB  and 
occurs  when  fl0bj  a  1.025  and  d//„  %  0.3;  the  opti¬ 
mum  directivity  for  z„,  -  10  is  about  16  dB  and 
occurs  when  p0  h,  %  1.025  and  <///<,»  0.2-0.3.  A 
comparison  with  the  maximum  directivities  for  the 
single  loop  in  Figure  2  shows  that  the  addition  of  the 
parasitic  reflector  increases  the  directivity  by  about 
3-4  dB. 

Far-zone  electric  field  oatterns  for  the  optimum 
configurations.  fl0h}  »  1.025,  d//.0  =  0.3,  are  pre¬ 
sented  in  Figure  8;  a  comparison  with  the  patterns 
for  a  single  loop  in  Figure  4  shows  that  the  parasite 
has  reduced  the  hack  lobe  in  free  space  (air)  signifi- 
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Fif  9  Magnitude  of  normaliaed  electric  Arid  components.  e<’|E|.  on  a  sphere  of  radius  0,,r  -  10  fur  loops  in 
air  tt„(.  a,i  -  Oi  osar  dissipative  half  space  u„, .  p#;l.  flubt  »  1.0.  A,  -  0075.  fieh,  •  1  035.  <f  a0  -  0.5. 0,-20. 
a,  *  ala.  lot  Air-aalcr  interface  la;  **  ho,  ptl  -  0  1  fhl  Air-eanh  interface  tml  -  10.  p.,  -  0  I. 


cantly  and  narrowed  the  lobe  in  the  medium  slightly. 
The  elTecls  that  dissipation  in  the  medium  (/>,.  - 
0.1)  and  a  finite  radius  of  observation  tfl„r  *»  |0) 
have  on  the  patterns  are  illustrated  in  Figure  9.  As 
for  the  case  of  a  single  loop,  the  directive  properties 
for  the  array  over  a  lossless  medium  predicted  using 
far-zonc  results  are  qualitatively  indicative  of  those 


for  the  array  over  a  medium  with  low  loss  at  a  finite 
radius. 

The  directive  properties  of  the  single  loop  and  the 
two-element  array  of  loops  in  free  space  (air)  over  a 
lossless  material  half  space  arc  summarized  in  Figure 
10.  where  the  directivity  IH 0  »  a)  is  plotted  against 
the  relative  effective  permittivity  of  the  half  space. 


Fig.  to.  Directivities  tin  decibels!  of  loop  antennas  in  air  (r„,  -  t,  pr,  -  0!  over  lossless  dieted nc  half  space 
(*-i*  P.i  -  0)  as  a  function  of  the  relative  permittivity  of  the  half  space  r„, .  p#p,  -  1.0,  A,  U%  -  0075,  0,-  20, 

Ml- IWJ.d/A,-0y.*1««,. 
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Tip.  I)  Measured  relative  fain  for  single  loop  as  a  function  of 
the  height  abuse  the  air*»  atcr  interface.  fi0bt  -  I  01.  Ot  *•  13  5 


attached  at  one  side.  The  half-loop  antennas  are 
mounted  on  the  image  plane  and  fed  from  behind  the 
plane.  A  small  monopolc  probe  protrudes  through 
the  image  plane  and  is  free  to  move  through  360  on 
a  circle  of  radius  r  =»  30  cm.  The  probe  is  used  to 
measure  the  field  component  £,«()  in  the  aii  and  in 
the  water.  At  the  measurement  frequency  of  900 
MHz  and  room  temperature,  the  electrical  properties 
of  the  fresh  water  are  approximately  r.„3  a  78.8. 
p, 2  a  5.3  x  10 '3.  In  the  air,  fl,r  5.65,  and  in 

the  water.  fl2r  a  50  6.  i,r  a  1.34. 

Figure  12  is  a  comparison  of  theoretical  and  exper¬ 
imental  results.  |E#|.  for  a  single  loop  and  a  loop 
with  a  parasitic  reflector.  The  experimental  data  are 
normalized  to  the  theoretical  resu'ts  at  one  point  in 
each  medium  (0  =  0  in  air.  0  =  180  in  water).  The 
dimensions  of  the  antennas  are  close  to  those  pre¬ 
viously  determined  to  be  optimum.  fi„b,  =  1.01, 
h,//.0  =*  0075.  /?0b,  «»  1 05.  d  /.„  =  0.225.  The  agree¬ 
ment  between  theory  and  experiment  is  seen  to  be 
excellent.  Note  that  the  theoretical  patterns  in  Figure 
12  are  not  the  far-zone  patterns,  but  they  are  the 
patterns  computed  for  the  measurement  radius  using 
the  full  theory  for  the  loop  array. 

The  pattern  in  Figure  12  for  the  single  loop  has  a 
small  lobe  in  the  water;  this  is  due  to  the  exponential 
attenuation  experienced  by  the  held  in  the  water. 

0.26.  The  multiplicative  factor  r"  was  not 
included  when  plotting  these  patterns  as  it  was  in 


Figures  5  and  9.  The  addition  of  the  parasitic  reflec¬ 
tor  is  seen  to  reduce  greatly  the  level  of  the  back  lobe 
in  the  air  in  relation  to  the  main  lobe  in  the  water. 

The  relative  gain  Gift  =  it)  of  a  single  loop  in  air 
above  fresh  water  was  measured  as  a  function  of  the 
height  of- the  loop  above  the  interface  «,//,.  The 
results  from  this  experiment,  normalized  to  1.0  at  the 
maximum,  are  presented  in  Figure  13.  They  show  a 
peak  in  the  gain  when  the  loop  is  close  to  the  inter¬ 
face.  as  do  the  theoretical  results  in  Figure  2  for  the 
directivity  of  the  loop  over  a  lossless  medium.  The 
discrepancies  between  the  graphs  in  Figures  2  and  13 
are  probably  the  result  of  the  experimental  gain 
being  measured  with  a  field  probe  at  a  finite  radius 
from  the  antenna. 

CONCLUSIONS 

The  theoretical  analysis  for  the  horizontal  circular- 
loop  antenna  over  a  planar  interface  has  bcco  ex¬ 
tended  to  treat  a  coaxial  array  of  loops  over  the 
interface.  Parametric  studies  were  performed  to  de¬ 
termine  the  optimum  directivity  for  transmission  into 
a  lossless  half  space  both  for  a  single  driven  loop  and 
for  a  driven  loop  with  a  single  parasitic  reflector. 
These  results  should  prove  useful  in  the  design  of 
antennas  for  directive  transmission  from  the  air  into 
earth  with  low  loss. 

The  theoretical  analysis  was  verified  by  making 
measurements  of  the  field  pattern  and  the  gain  of 
loops  in  air  above  fresh  water. 

The  directive  properties  of  loops  with  other  shapes, 
i.e.,  not  circular,  are  expected  to  be  similar  to  those  of 
the  circular  loop  when  the  loops  are  near  resonant 
size  (the  circumference  divided  by  the  wavelength  in 
air  is  approximately  equal  to  1 1.  A  schematic  drawing 
showing  a  possible  simple  construction  for  a  two- 
element  array  of  loops  above  the  earth  is  shown  in 
Figure  14.  The  polygonal  loops  (octagonal  in  the 
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Fig  14.  Schcmsuc  drawing  showing  possible  conuriKTKm  lor 
iwiKlemni  array  of  loop*  above  the  earth. 
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figure)  are  formed  by  stretching  wire  around  equally 
spaced  insulating  poles.  The  driven  loop  is  fed  in  a 
manner  to  support  the  resonant  component  of  the 
current  in  the  loop.  For  a  circular  loop  this  would  be 
the  Fo»  rier  senes  component  21,  cos 
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Electric  Field  Probes — A  Review 
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min,  mp«U(Wm  vhmMm  Um.  and  imw  N>ki  N«  keen 
faaplcmcwled  fa  *  v  acieey  od  Tkrae  Nlliliwl  dlpolee  in  fmnMj 

uti  la  M  t -field  peoke  M  peovlde  •  ntpwue  wkkk  b  nearly  laoeraple 
far  all  fbululMi  at  ike  IkSm  IfeM.  DbN  deeeetne*  klbbnnf 
•US  electrically  tfcnrt  ar  realdlvliy  IfN  dipole*  ta  peodac*  »«rj  fcroad- 
kaad  fntai  111  MHa  la  2*  CHal.  TVrawcaapla  fmaan  art  nf  M 
provide  Irat  ilme-everaged  Bala  far  high  peak-power  aafilaM  IWMi. 
Optical  A  km,  lagrikrr  abk  a  aaiaakty  aidilaif  l<M  am.  may  ka 
■ted  la  fana  a  wide- band  aaapmaikiag  Saia  Mak  fi.w  ikr  dipole  aaS 
del  kin  la  a  rraalr  rwmdwml  Appllreimi  af  f-fklB  peek**  range  Ima 
I  ha  ataartaab  af  IWMi  la  Utlaf  animals  aipaarB  M  non  mailing  ran  la- 
llaa  la  Ikr  aaaunani  a#  ArlBa  la  air  far  electromagnetic  comparability 
ar  raBiailaa  ulHi  pwpam,  Probes  ara  evjllakle  ikat  caa  araiar  IWM 
rrreagtk*  fraaa  Ini  ikaa  I  V/m  la  a<if  IBM  V/m  (naat. 


INTRODUCTION 

FOR  MANY  YEARS  electric  fields  have  been  meaaiied  in  air 
and  in  material  media  using  electric  field  probes.  The  letm 
"f -field  probe"  will  be  used  lo  describe  a  variety  of  measurement 
tools  with  the  following  basic  characteristics:  a  dipole  antenna 
with  a  detector  mounted  across  the  gap  which  separates  the  two 
arms  of  the  dipole,  a  nonperturbing  data  link  connecting  the 
detector  output  w  ith  a  icmote  observation  site ,  and  the  ability 
lo  measure  accurately  field  strengths  from  about  I  V/m  to  1000 
V/m  (rms|.  Three  mutually  perpendicular,  lingle-anlenna.f-fitld 
probes  may  be  combined  in  a  closely  spaced  array  lo  construct 
a  probe  with  an  isotropic  response. 

Early  versions  of  the  £-field  probe  were  usually  "homemade” 
one-of-a-kind  devices  used  lo  measuie  relative  field  distributions, 
such  as  Che  field  in  the  aperture  of  a  microwave  anlenru  ( I } . 
When  concern  atose  over  the  possible  health  hazards  of  non¬ 
ionizing  electromagnetic  radiation  and  government  safety  stand¬ 
ards  for  human  exposure  were  developed,  a  need  was  created  for 
probes  that  could  make  an  accuiate,  absolute  measurement  of 
electric  fields  with  a  wide  range  of  parameiers.  such  as  the  level, 
frequency,  and  polarization.  A  new  gencrauon  of  f-field  probes 
was  developed  by  government  laboratories  and  commercial  firms 
to  meet  this  need.  This  paper  presents  lire  basic  theory  for  these 
probes,  describes  several  practical  design  that  have  been  imple¬ 
mented.  renews  the  mayor  applications  of  the  probes,  and  dis¬ 
cusses  the  state  of  the  art  of  these  devices,  including  develop¬ 
ments  which  are  likely  to  occur  in  the  near  future. 
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For  a  comprehensive  list  of  rtferanett  on  tue  design  and  uic  of 
electric  field  probe*,  the  reader  n  referred  lo  the  extensive  bib¬ 
liography  of  reference  1 10)  - 

II.  BASIC  PRINCIPLES  OF  OPERATION 

Several  electric  field  probet  have  been  developed  with  a  con¬ 
struction  similar  to  that  shown  schematically  in  Fig.  I.  There 
prober  contain  five  basic  elements:  a  dipole  antenna,  a  nonlinear 
detector,  optional  lumped  element  duping  and  filtenng  net¬ 
works.  a  nonperturbing  transmission  line  and  monitoring  instru¬ 
mentation  |2)  -(10) .  The  operation  uf  the  probe  is  fairly  simple. 
For  a  continuous-wave  incident  field  with  the  frequency  u.  the 
antenna  produces  an  oscillating  voltage  across  the  detector  at  us 
teiminals.  Due  to  the  nonlinear  characteristics  of  the  detector, 
a  signs  with  a  dc  component  proportional  to  the  square  of  the 
amplitude  of  the  incident  field  h  developed  at  the  detector.  This 
signal  is  filtered,  and  the  ir  component  is  conveyed  over  the 
transmission  line  to  the  monitoring  insttumenution.  Thus,  a 
signal  proportional  lo  the  square  of  the  amplitude  of  the  incident 
field  is  measured. 

In  the  brief  analysis  that  follows,  the  simplified  probe  of  Fig. 
2(a)  is  used.  The  lumped  element  shaping  and  filtering  networks  sre 
not  included  in  this  probe,  the  lossy  transmission  line  connecting 
the  detector  to  the  monitoring  instrumentation  provides  the  low- 
pass  filtering  for  the  detector.  The  incident  continuous-wave 
electric  field,  for  simplicity,  is  assumed  to  be  parallel  to  the 
axis  of  the  dipole: 

£,'(?.  I)  =  £,'(r)  cos  (cor  +  ♦,(?)!  z 

-Re  (E/(?. rd^'H.  U> 

where  Re  indicates  the  teal  part  and  bold  type  indicates  a  phasor 
quantity.  A  more  general  field  will  be  considered  later. 

The  incident  electric  field  generally  is  not  uniform  along  the 
axb  of  the  dipole  antenna  (z  axis).  To  provide  spatial  resolution 
of  the  field,  the  antenna  is  often  made  physically  short  and  elec¬ 
trically  short,  0gh  —  2rthf\y  <  I .  where  h  is  the  half-length  of  the 
dipole  and  is  the  wavelength  in  free  space  1  The  voltage  across 
the  terminals  of  the  electrically  short  dipole  when  they  are  open 
circuited  is  spproximaicly  proportional  to  the  incident  electric 
field  at  the  center  of  the  dipole  (the  otigin  O  in  Fig.  I): 

K«(u)m  *E,'(0,to).  (2) 

and  the  impedance  of  the  driven  dipole  Is  approximately  capaci- 
livt: 

ZA  (u)  -  -UuCA  «  -lit  |ln  (*/«4  )-l|  /*M.  (J) 

where  aA  is  the  radius  of  the  dipole  conductor  and  to  •*  •*>*  tm- 

t  Spiral  resolution  tl  determined  by  the  variation  of  the  modern  elec- 
Irk  held  over  the  length  of  the  dipole.  I  oi  hi»h  rewKitron.  Ihe  length  of 
the  dipole  must  be  uniU  compared  to  Ihe  distance  ovei  whack  the  g/ed-nt 
of  the  electric  Tic  id  it  significant. 
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Fit- 1.  Schwiik  of  wipini  probs. 


T»* 


Is  I  ft)  Simplified  probe  (b)  Equivalent  circuit  for  probe  with  diode 
oe  thermocouple  dcloclor. 

pedance  of  free  ipacc  [II],  Theie  two  element!,  the  open  circuit 
voltage  and  the  antenna  Impedance,  form  the  Thevenin  equivalent 
circuit  for  the  receiving  dipole  ihcwn  on  the  left  of  Fig.  2(b). 

Detector 

The  detector  in  the  probe  it  often  an  unbiaied  point  contact 
or  Schottky  barrier  diode  operating  in  the  square-law  region,  or  ■ 
thermocouple  junction.  In  Fig.  2(b)  the  equivalent  circuit  for  the 
diode  Is  divided  into  high  and  low-frequency  aectioni.  The  high- 
frequency  lection  contiiti  of  the  junction  resistance  Rt  and 
capacitance  C/,  and  the  series  resistance  /?,,  the  parasitic  elements 
saociated  with  the  packaging  of  the  diode  are  omitted.  The  low- 
frequency  circuit  contains  s  voltage  sour;e  and  the  video 
resistance  ft,  *  R,  +  Rf.  For  square-law  operation,  the  voltage 
source  is  proportional  to  the  time  averag;  of  the  radio-frequency 
power  absorbed  by  the  diode  (i2|,(l3]: 

v4-l4>e.  («) 

where  y4  it  the  voltage  tenutivity  of  the  diode. 

For  a  thermocouple  detector  the  hot  junction  of  tha  thermo¬ 
couple  with  a  aeries  resistance  R,  is  placed  scrota  the  terminals 
of  the  dipole;  see  the  inset  in  Fig.  2(a).  The  series  resistor  may  be 
a  thin  Aim  of  evaporated  metal  about  1000  A  thick  forming  the 
hot  junctk.1.  The  radio-frequency  current  throuih  the  resistor 


dissipates  power  f,;  this  raises  the  temperature  c  f  the  hot  junc¬ 
tion  Th  above  that  of  the  cold  junction  Tc  and  |  roduces  a  ther¬ 
moelectric  voltage  Kr  that  is  approximately  prop  irtional  to  the 
lime-average  power  dissipated  in  the  resistor; 

Vt  “  eiTn  ~Tc)  *7r  A-  (5) 

where  a  is  the  See  beck  coefficient  for  the  particular  combination 
of  materials  used  in  the  thermocouple.  The  high-frequency  sec¬ 
tion  of  the  equivalent  circuit  for  the  thermocouple  detector  con¬ 
sists  of  the  resistor  R,  with  possibly  a  parallel  capacitance  C,  to 
account  for  a  change  in  the  geometry  at  the  junction.  The  low- 
frequency  circuit  contains  the  thermoelectric  voltage  source  V, 
and  the  series  resistor  R,t.  Note  that  due  to  frequency  depend¬ 
ence,  the  resistance  R,  t  may  not  equal  R,- 

Retiithe  Tnnpninion  Line 

The  transmission  line  connecting  the  detector  to  the  monitor¬ 
ing  instrumentation  hat  an  interna)  resistance  [er  unit  lengths' 
for  each  conductor  and  a  capacitance  per  unit  length  e.  The 
resistance  pet  unit  length  is  usually  chosen  to  be  much  greater 
than  the  inductive  reactance  per  unit  length  ( 2 /  >  w f).  making 
the  characteristic  impedance  and  wave-number  on  the  line  ap¬ 
proximately 

Z.( to) - R,  +/J T,  *  0  -/).  (6) 

and 

*i(w)  •>  y/Fuc  (1  -/).  (7) 

in  addition,  the  length  of  the  line  is  selected  so  that  the  attenua¬ 
tion  of  a  wave  propagating  over  it  will  be  large  ( I  exp  ( ~iktt )  I  < 
I)  at  all  of  the  radio  frequencies  ot  interest.  The  high  resistance 
per  unit  length  of  the  transmission  line  produces  three  effects:  it 
reduces  the  direct  reception  of  the  incident  Field  by  the  line,  it 
reduces  the  scattering  of  the  incident  Field  by  the  line,  and  it 
makes  the  line  behave  as  a  low-pan  Filter  [14], 

The  transmission  line  can  behave  at  a  receiving  antenna  for  the 
incident  field  and  produce  a  signal  at  the  detector;  this  will  cause 
the  Field  pattern  for  the  probe  to  differ  from  that  of  the  short 
dipole.  The  principal  perturbations  introduced  are  a  shift  in  the 
position  of  the  nulls  in  the  elevation  pattern  for  the  dipole  and  a 
response  to  ei-ctric  fields  orthogonal  to  the  axis  of  the  dipole 
(14] .  For  a  lossy  two-wire  line  v  ith  a  conductor  spacing  b.  the 
relative  distortion  in  the  dipole  pattern  for  a  plane  wave  incident 
is  approximately  proportional  to  the  dimensionless  parameter 

X  -  — (6/»Xf%/2',»)  («) 

No’e  that  the  parameter  x  is  quadratic  In  the  dipole  length  A, 
but  linear  in  both  the  spacing  b  and  the  resistance  per  unit  length 
(I /s')  of  the  transmission  line  conductors.  Thus,  the  relative  dis¬ 
tortion  is  kept  fixed  while  halving  the  dipole  length  either  by  de¬ 
creasing  the  conductor  spacing  by  a  factor  of  four,  or  by  increas¬ 
ing  the  conductor  resistance  by  a  factor  of  four. 

When  the  probe  ‘s  used  in  a  multifrequencv  environment  with 
widely  separated  frequency  components,  the  reception  by  the 
transmission  line  may  not  be  negligible  at  all  of  the  frequencies. 
As  an  example,  consider  a  probe  being  uwd  to  measure  a  high 
radio-frequency  signal  in  close  proximity  to  an  electronic  device. 
Any  low-frequency  ac  fields  produced  by  the  device  (interference 
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for  th*  probe)  »a  votpk  to  Dm  piobe's  l/ansmisssoo  Use.  The 
filtering  Kltcn  rt  ituUm  lint  may  not  be  sufficient  at  the 
low  frequenc;  .pprex  the  propagation  of  the  taterferance 
cm  the  trimm  <s.on  line  to  the  detector  end  to  the  monitoring 
InUrumenUtkm. 

▼he  curie nt  induced  in  the  uinuniuiofl  line  by  the  Incident 
fleiu  '.-vJ'tcei  e  secondary  or  tceltered  field  which  also  miy  be  e 
•ource  '(  error  in  the  metn.--tme.il.  The  reduction  in  the  tut- 
-er *2  i.;M  that  results  from  the  ute  of  resistive  conductor!  it 
illustrated  i>i  Fig.  3(a).  where  the  normaitztd  total  scattering  cross 
tecta rc  o!  the  line  Og/Xj  i»  ihown  as  t  function  nl  the  nor- 
me../*.',  '...stance  per  unit  length  R  ■  e'Wfe-  The  incident  field 
is  e  p'sne  wave  with  the  electric  field  parallel  to  the  conductors, 
and  th:  line  is  one  wavelength  long  r/Xo  “  1.0.  The  two  regions 
marked  on  the  graph  represent  typical  resistaves  at  a  frequency 
of  I  GHz  for  a  round  carbon-Teflon  conductor  developed  by  the 
National  Bureau  of  Standards  (at  **  0  38  mm,  R  m  65.6  ICf 1/m) 
and  thin  metallic-film  conductors  (r*  «  1  -  10 Mfl/m)  (5) .  {6} . 
From  this  graph,  it  is  clear  why  the  highly  resistive  transmission 
lines  are  often  referred  in  as  “transparent"  to  electromagnetic 
fields  at  high  radio  and  i.ikrowave  frequencies. 

A  transmission  line  with  a  high  resistance  per  unit  length  it 
very  dispersive.  This  is  illustrated  in  Fig.  3(b)  where  the  voltage 
transmission  ratio  l'fiyT(O)  is  shown  as  a  function  of  the  fre¬ 
quency  for  20  cm  long  lines  made  from  carbon-Teflon  and  thin 
film  conductors.  The  tiansmistion  line  is  seen  to  behave  as  a  low- 
pass  filter  with  little  distortion  occuring  for  signals  with  fre¬ 
quencies  below  the  point  where  |*t«|  **  1.0.  In  the  high-fre¬ 
quency  equivalent  circuit  of  Fig.  2(b).  the  input  impedance  of  the 
transmission  line  appears  across  the  diode:  for  a  line  with  high 
loss  ( | exp  (~jkLt)  |  <  I ).  this  is  approximately  the  characteristic 
impedance  Zc.  In  the  low-frequency  circuit,  the  transmission  line 
it  represented  by  a  “Pi"  low-pass  filter  network. 

Probe  Rapoiue 

The  response  for  the  probe,  i.e..  the  voltage  Vm  across  the 
input  impedance  Z„  *•  Rm  +  iXm  to  the  instrumentation,  is 
easily  determined  from  the  equivalent  circuit  in  Fig.  2(b).1  The 
series  resistance  R,  is  set  equal  to  aero  since  it  is  often  much 
smaller  than  the  junction  impedance;  the  response  for  a  probe 
with  a  diode  detector  is  then 


r*  ~ 

-  1= 


Fig.  3.  Chincicrislki  of  toay  bifimWon  S*ei  (a)  Normalized  (ftil 
acjtttting  Croat  tcction  of  a  ona  wavelength  ten*  Una  i/a0  •  |,0  at  a 
function  of  the  normalize*  rrutianct  per  unit  length,  fb)  The  trait* 
motion  tine  at  a  low-put  fitter.  •  •  20  cm.  c  *  20  pF/m. 

with  the  diode.  With  that  approximation  and  (2*.  (9)  ttmphfiea 
to  become 

“C|E/(0.  w)|*  (10s) 


_ _ *„(w*,C,),y,>|r.c|* _ 

«/(*«.+*.  +  2r'*Kll  +  *,*t/|Zcl,J*  +  [w*,(C4  +  C,)  -*,jrt/|Zt|,)»7 


Tlx  same  expression  applies  to  a  probe  with  a  thermocouple 
detector  when  Rr  C,.  /?..  and  fa  are  replaced  by  R,.  C,.  Rl9. 
and  7(.  Note  that  the  capacitance  of  the  elements  in  the  “Pi" 
network  of  Fig.  2(b)  anu  the  reactance  of  the  impedance  Zm  do 
not  appear  in  (9),  since  a  dc  signal  is  detected  when  the  incident 
field  is  a  continuous  wave. 

The  lossy  transmission  line  is  usually  designed  to  have  in  input 
impedance  that  is  laige  compared  to  the  impedance  of  the  diode 
(IZ,  |  >  Rj).  If  this  requirement  cannot  be  satisfied  by  the  trans¬ 
mission  line  alone,  a  lumped  series  resistance  can  be  added  to 
each  conductor  of  the  ime  to  increase  the  itipedance  in  parallel 

1  A  complete  dimiuwn  of  the  probe's  rnpone.  incbdlni  Itie  effecti  of 
pustule  dementi  In  the  diode  eqomtent  circuit,  h  |i*«n  In  |  II,  ch.  J|. 


We -l*/(C,  ♦<,)}-•.  <IOb) 

Each  bracketed  teim  in  this  equation  is  associated  with  elements 
in  the  rquivsient  circuit.  Fig.  2(b).  The  first  -.im  accounts  *ir  the 
frequency  dependent  division  of  the  antenna's  open-circuit 
voltage  i'of  between  the  snt-une  impedance  end  the  diule  im¬ 
pedance.  Hie  second  teim  is  .ne  time-aversgc  of  the  ndio-fre- 
quency  powei  delivered  to  the  diode  when  the  diode  impedance 
is  luge  compared  to  the  antenna  impedance.  The  last  teem  rep¬ 
resents  the  division  of  the  detected  dc  voltage  between 
the  video  lesistance,  the  transmission  line  lesistance  and  the 
resistance  of  the  monitoiing  in.trumenlanon. 

The  behavior  of  the  icsponse  with  frequency  u  shown  in  Fig. 
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4;  it  it  k«r  to  have  two  distinct  regions.  Below  the  frequency  ui, 
the  tcsponte  approaches  an  asymptote  which  decreases  as  the 
square  of  the  frequency  (-12  dB/octave).  and  above  <JC  the 
response  approaches  an  asymptote  which  is  independent  of  the 
frequency.  Thu  frequency  dependence  is  easily  understood  by 
examining  the  voltage  division  between  the  elements  in  the  high- 
frequency  equivalent  circuit  of  Fig.  2(b)  and  is  discussed  in  detail 
in  Section  lll-A. 

When  the  frequency  is  increased  beyond  the  point  where  the 
dipole  antenna  u  electrically  short  (fioh  2  0  3).  the  elements  in 
its  equivalent  circuit  differ  from  the  values  in  (2)  and  (3).  and  the 
response  it  no  longer  given  by  (10a)  A  typical  response  for  the 
electrically  longer  antenna  a  shown  as  a  dashed  line  in  Fig.  4. 
and  it  it  seen  to  peak  in  the  vicinity  of  the  frequency  for  the  first 
resonance  of  the  dipole  tJ,  (at  or,,  do*  *  */‘)  The  response  a 
relatively  flat  over  the  frequency  range  which  extends  from  about 
2  m,  to  0  3m,.  In  practical  designs,  the  resonant  frequency  m, 
it  determined  by  the  length  of  the  dipole,  and  the  frequency  m« 
it  changed  mainly  by  adjusting  the  junction  resistance  R,.  Increas¬ 
ing  Rt  extendi  the  region  where  the  response  is  flat  to  lower  fre¬ 
quencies;  however,  an  increase  i.  also  decreases  the  sensitivity 
of  the  probe  (the  output  I  Vm  1 1  '  a  fixed  field  |  Ej  I)  unless  the 
input  resistance  of  the  transmission  Rm  is  large  compared  to  the 
junction  resistance.  The  factor  y«  is  approximately  proportional 
to  R and  R,  <•  Rt;  thus,  the  response  (10a)  it  proportional  to 
the  frequency  independent  factor  Rm/{Rm  +  */  +  >'r|  when 
M  ►  mc.  A  change  us  the  junction  resistance  Ri  wjl  not  affect 
the  sensitivity  of  the  probe  provided  Rm  ♦  2 r't  >  R/- 

The  preceding  analysts  is  for  a  continuous-wave  incident  field. 
The  response  of  a  probe  with  a  diode  detector  to  an  amplitude 
modulated  incident  field  will  be  limitir  to  (10a).  provided  the 
frequencies  in  the  square  of  the  modulating  signal  are  within  the 
pass  band  of  the  tow-pass  filter  formed  by  the  lossy  tisnsmisuon 
line,  le„  |*ts|  •<  1  at  these  frequencies.  A  factor  /’(/).  where 
)[t)  it  the  modulating  signal,  mutt  be  included  in  (10a)  in  this 
event.  For  the  result  in  (IQs)  to  apply  to  the  measurement  of  tn 
amplitude  modulated  field  with  a  thermocouple  detector,  the 
thermal  time  constant  of  the  detector  alio  mutt  he  short  com¬ 
pared  to  the  period  of  the  highest  frequency  contained  in  the 
modulation. 

hotnpkhobt 

Consider  the  general  monochromatic  incident  field  expressed 


fc)  <« 

Fit.  5-  (a)  Thi««  c  thofoml  dipokt  with  a  common  c cater  (b),  (c).  (d) 
Practical  irrattmcati  of  tfercc  orthogonal  dgoki  wait  daplacad, 
can  ten. 

as  the  sum  of  (lure  orthogonal  components  each  of  the  form  (1 ): 
£'(e.0-  2  (f)  cos  {ur +  4.(?)1 1? 

-  Re  £  2  (r* 

-  Re  (E,(f.M)e'“').  (12) 

Each  of  the  three  orthogonal  dipole  probes  shown  in  Fig.  5(c), 
when  placed  in  this  field,  will  have  a  response  proportional  to 
the  square  of  the  amplitude  of  s  field  component: 

l*'«l.-C|Ei(0.w)|\  it  **.)'.*.  (13) 

Aftet  the  three  rtiponses  ere  summed,  a  signal  proportional  to 

the  squire  of  the  Hermitian  magnitude  of  the  complex  vector 
field  E'(0,w)  is  obtained  (I5|: 

2  IT.I.-C2  IE'(0.w)l*-C|£<(0.w)|*.  (14) 

m  m 

This  signal  it  independent  of  the  orientation  of  the  probe  with 
teipect  to  the  field;  thus,  the  responses  of  the  probe  composed 
of  the  three  orthogonal  dipoles  is  isotropic.  Note  that  the  Her- 
mitiso  magnitude  of  the  complex  field  is  an  upper  bound  on  the 
instantaneous  field; 

|£'(0./)|<|E<(0.m)|.  (15) 

In  practical  field  probes,  the  centers  (terminals)  of  the  three 
orthogonal  dipoles  generally  ere  not  coincident  as  in  Fig.  5(a). 
but  they  are  displaced  from  each  other  is  in  the  configurations 
shown  in  Figs.  5(b),  (c).  and  (d)  (4j.  (5|.  |7j.  Each  dipole  in 
Fig.  5(b)  is  mounted  on  s  planar  substrate;  two  of  the  dipoles 
nuke  an  angle  of  45*  and  the  third  an  sngle  of  90*  with  the  long 
axis  of  the  substrates.  When  the  three  substrates  arc  combined  in 
the  “I-beam"  configuration  the  Jtpoiei  ara  orthogonal.  In  Fig. 
5(c),  the  d-polet  are  placed  on  planar  substrates  that  are  com¬ 
bined  to  form  i  tube  whole  cross  section  is  an  equilateral  trunglc. 
Each  of  the  orthogonal  dipoles  mskes  tn  acute  angle  of  54.74 
with  the  axis  of  the  tube.  The  three  of  the  orthogonal 
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dipoles  In  Fl|.  5(d)  in  along  the  lateral  edges  of  a  regular  pyra¬ 
mid  whose  base  to  an  equilateral  triangle.  Each  of  the  dipoles 
makes  an  acuta  angle  of  54.74*  with  the  altitude  of  the  pyramid, 
and  the  handle  of  the  probe  Is  an  extension  of  the  altitude.  When 
the  centers  of  the  dipoles  are  displaced,  at  In  Figs.  5(b),  (c).  and 
(d),  each  probe  measures  a  field  component  at  a  different  posi¬ 
tion  In  space.  The  three  components  can  be  combined  to  estimate 
the  Hermltian  magnitude  of  the  field  at  a  tingle  point  (14)  only 
if  the  field  is  assumed  not  to  easy  over  the  volume  of  space  occu¬ 
pied  by  the  dipoles.  If  the  dipoles  are  physically  and  electrically 
short,  this  volume  will  alto  be  physically  and  electrically  snail, 
and  the  assumption  of  a  uniform  field  within  the  volume  is 
justifiable. 

When  the  electric  field  probe  is  used  in  inhomogeneous  mate¬ 
rial  media,  such  at  biological  tissue,  the  normalized  response  of 
the  probe  I  Vm  |/|Ej  I1  may  vary  with  position.  For  an  electri¬ 
cally  short  dipole,  this  variation  is  mainly  due  to  the  change  in  the 
impedance  of  the  antenna  ZA  with  a  change  in  the  electrical  con¬ 
stitutive  parameters  of  the  material  surrounding  the  probe.  The 
variation  can  be  reduced  by  making  the  antenna  impedance  small 
compared  to  the  impedance  of  the  detector  (CA  >  C/  in  the 
region  where  the  resonse  is  flat)  or  by  minimizing  the  variation  in 
the  antenna  impedance  by  insulating  the  dipole  (11).  (16) , 
(17) .  The  insulated  dipole  is  formed  by  coating  the  antenna  with 
a  material  whose  relative  dielectric  constant  is  lower  than  that 
of  the  surrounding  medium  e,.  The  impedance  of  the  insulated 
dipole  it  fairly  insensitive  to  variations  in  the  electrical  param¬ 
eters  of  the  surroundings;  this  is  illustrated  in  Fig.  6,  where  the 
capacitance  of  a  paiticular  electrically  short  insulated  dipole  is 
seen  to  have  kttle  variation  with  e,  once  r,/t„  2  S. 

111.  IMPLEMENTATION 

A.  h -nix  with  Diode  Deletion 

The  basic  detign  of  this  device  has  been  optimized  by  the  US. 
National  Bureau  of  Standards  (NBS)  (10)  for  coverage  over  the 
frequency  ranges  0.2  to  1000  MHz.1  Square-law  response  (output 
voltage  proportional  to  lE'l1)  over  the  range  from  about  1  V/m 
to  2000  V/m  (rmsj  it  provided  by  using  electronic  circuitry  which 
compensates  for  the  nonsquare-law  response  of  the  diode  detector 
at  high  output  voltage  levels  (greater  than  about  2$  mV).  An 
array  of  three  orthogonal  dipoles,  each  of  total  length  2A  »  I  cm, 
is  used  to  provide  isotropic  response.  This  is  achieved  by  placing 
each  dipole  on  a  dielectric  substrate,  and  combining  three  of  the 
substrates  to  form  a  triangular  tupport  frame  and  handle,  as  in 
Fig.  5(c). 

The  use  of  a  beam-lead  carrier  package  for  the  Schottky  diode 
chip  minimizes  detector  parasitic  inductance,  to  provide  a  flat 
nonresonant  frequency  response  over  a  wide  portion  of  the  RF / 
microwave  frequency  range.  A  high  resistance  transmission  line 
(low-pan  filter)  it  formed  by  a  sandwich  arrangement  which  uses 
two  flat  thin  carbon  impregnated  Teflon  strips,  with  a  resistance 
per  unit  length  of  about  4  Mfl/m,  attached  to  a  thm  insulating 
double-tided  adhesive  tape.  This  probe  is  used  in  free  space  for 
both  radiation  hazard  measurements  and  for  electromagnetic 
compatibility  measurements,  with  a  standard  shielded  cable  or 

1 A  tomirunU  firm,  Hobday  InduitiKi.  Edina.  AN.  has  adapted  the 
NBS  design  by  adding  resistance  lo  the  dipole  clemenu  and  using  a  glats- 
packagtd  diode.  The  resultant  device  covers  the  frequency  range  0.5  to  4000 
MKa. 


point  in  mediim  with  rehitve  dielectric  constant  «„  kft  ■  1 1,  t/a  •  1J, 
•rim  J.l.k/Lg-J.7  X  10-5. 


another  lossy  transmissioii  line  connecting  the  probe  to  an  elec¬ 
tronic  "readout  box." 

The  National  Center  for  Devices  and  Radiological  Health  (for¬ 
merly  the  Bureau  of  Radiological  Health  (BRH))  and  its  contrac¬ 
tors  have  developed  miniaturized  venions  of  the  above  probe 
using  thin-film  technology.  The  intended  application  of  these 
miniature  E-field  probet  ia  primarily  for  implantation  in  living 
animals  or  models  used  In  biological  effecla  studies  (18).  These 
probes  ire  constructed,  as  shown  in  Fig.  7,  from  a  diode  chip 
with  integral  beam  leads  and  a  dipole  antenna  of  total  length 
2k  ■  1.5  mm.  The  conductors  of  the  lossy  parallel  wire  trans¬ 
mission  line  ere  msde  of  highly  resistive  thin-film  mileriel.  Each 
dipole  probe  is  mounted  on  a  dielectric  substrate,  snd  three 
substrates  are  combined  U  form  a  tube  of  triangular  cross  sec¬ 
tion,  as  shown  in  Fig.  5(c).  T>.e  array  of  three  dipoles  forms  an 
iaotroplc  probe.  The  tip  of  the  probe  is  encapsulated  in  t  dielec¬ 
tric  material.  The  substrate  snd  the  enctpsulitkm  serve  is  insula¬ 
tion  on  (he  dipole  entennas  and  help  to  make  the  probe’s  re¬ 
sponse  independent  of  the  electrical  parameters  of  the  surround¬ 
ing  medium  when  the  probe  to  used  in  a  material  with  a  high 
dielectric  constant,  like  biological  tissue. 

Squire-law  detection  of  E-fields  to  accomplished  by  limiting 
the  RF  voltage  applied  to  the  diode;  this  to  i  fortunate  conse¬ 
quent «  of  making  the  electrical  length  of  the  dipole  very  small. 
Other  meins  of  shaping  the  delected  voltage  vetsui  RF  field 
strength  (such  as  the  use  of  electronic  circuitry,  either  enalog  or 
digital)  are  not  necessary. 
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Th«  diodes  in  the  probe  tit  selected  so  (hit  their  junction 
resistance  R,  (sec  the  equivalent  circuit  of  Fig.  2(b))  is  large 
enough  to  appear  as  an  open  circuit  in  comparison  to  the  antenna 
impedance  2 4  at  h:gh  radio  frequencies  (w  >  u{)  The  total 
open-circuit  voltage  of  the  antenna  Kac  then  appears  across  the 
diode,  and  the  response  of  the  probe  with  frequency  is  flu.  as  seen 
Fig  4.  Since  the  antenna  impedance  is  primarily  capacitive  (the 
capacitance  is  about  0.1-0  2  pF),  it  increases  with  decreasing  fre¬ 
quency.  and  at  low  radio  frequencies  eventually  is  greater  than 
the  diode  impedance.  The  open  circuit  voltage  of  the  antenna  is 
then  divided  between  the  antenna  impedance  and  the  diode  im¬ 
pedance.  and  the  response  of  the  probe  decreases  with  decreasing 
frequency  (the  - 12  dB/actave  decrease  that  occurs  at  frequencies 
below  u,  in  Fig.  4).  The  larger  the  value  of  the  junction  resist¬ 
ance.  the  lower  the  frequency  ue  at  which  the  probe  response 
begins  to  roll  off.  The  junction  resistance,  however,  cannot  be 
increased  indefinitely,  since  the  video  resistance  of  the  diode 
R.  increases  with  the  junction  resistance  ( R ,  *  R,  +  R,).  When 
the  video  resistance  becomes  large  compared  to  the  resistance  of 
the  metering  instrumentation  Rm,  an  insufficient  amount  of  the 
detected  voltage  will  appear  across  the  instrumentation.  The 
net  result  is  that  the  "high-impedance  aero-bus"  diodes  or 
"medium  barrier"  Schollky  diodes  in  the  minialuie  i'-fjeld  probe 
are  chosen  to  have  an  optimal  junction  resistance,  i.e..  one  large 
enough  to  produce  sufficient  bandwidth  (low  uc).  yet  not  so 
high  at  to  significantly  decrease  the  sensitivity  (At,  not  large  com¬ 
pared  tort.,)  Use  of  the  optimal  detector  diode  provides  a  flat 
frequency  response  for  the  probe  over  the  range  from  1 00  MHz 
to  beyond  12  GHz  and  a  sensitivity  which  enables  measurement 
of  field  strengths  of  a  few  V/m. 

The  lossy  transmission  line,  which  is  about  30  cm  long  in  this 
piobe.  it  connected  so  a  telemetry  system.  This  system  contains 
a  preamplifier  that  drives  an  analog  10  digital  converter  (voltage- 
controlled  oscillator):  the  output  of  the  converter  modulates  a 
light  emitting  diode  producing  optical  data  pulses  that  are  trans¬ 
mitted  over  a  fiber  optic  to  a  remote  readout,  see  Fig.  *  [I9|. 
The  battciy-operated  three  channel  telemetry  system  is  housed  in 
a  metal  cube  with  sides  of  approximately  3  cm  in  length.  Scat¬ 
tering  errors  introduced  by  the  telemetry  unit  are  less  than  about 
0.23  dB.  when  the  probe  is  in  free  space. 

A  recent  paper  describes  a  broad-band  (200  KHz-26  GHz) 
probe  with  diode'detectors  developed  by  a  commercial  firm  |9]  * 
The  three  dipoles  in  this  probe  are  each  of  length  2*  *  3.2  cm. 
Resistive  strips  carry  the  detected  signal  from  the  Schottky- 
barrier  diodes  at  the  terminals  of  the  dipoles  to  the  monitoring 
Instrumentation  The  broad-band  response  of  the  probe  is  ob¬ 
tained  by  making  the  dipoles  from  resistive  thin  film  and  includ¬ 
ing  a  shaping  network  at  their  terminals.  Note  that  the  dipoles 
of  thi.  probe  are  electrically  long  at  the  upper  frequencies  in  its 
specified  range  of  use  (2h  a  2.8X0  at  26  GHz).  Thus,  unlike  the 
electrically  short  dipoles  discussed  earlier,  the  response  of  this 
probe  will  be  a  weighted  average  of  the  field  over  the  length  of 
the  dipoles  whose  centers  may  be  displaced  by  a  few  wavelengths 
at  the  higher  frequencies. 

B  Frobet  with  Thermocouple  Deletion 

Several  isotropic  C-fleld  probes  with  thermocouple  detectors 
have  been  developed  for  use  over  consecutive  frequency  bands, 
which  when  combined  cover  the  entire  frequency  range  of  10 
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Fig  I.  £  -Field  probe  with  optical  telemetry  system. 


F«.  9.  (a)  Antenna  element  formed  from  thermocouple  array,  (b)  De- 

tad  of  thermocouples 

MHz  to  26  GHz  |3),  (4).'  In  one  probe,  three  antennas,  each 
being  several  centimeters  long,  are  arranged  in  the  orthogonal 
array  shown  in  Fig.  3(d).  For  broad  band  response  each  antenna 
ia  formed  from  many  antimony-bismuth  thermocouples  (detec¬ 
tors)  distributed  along  a  hairpin  curve,  sec  Fig.  9(a).  Note  that  the 
hot  junctions  of  the  thermocouples.  Fig.  9(b),  are  formed  by 
decreasing  the  cross  sectioa  of  the  conductors:  this  increases  the 
electrical  resistance  and  the  power  dissipation  at  these  points. 
The  resulting  sntcnna/deteclor  of  this  probe  is  relatively  ineffi¬ 
cient  and  does  not  perturb  the  field  being  measured.  The  low 
sensitivity  is  compensated  for  by  providing  a  low-noise  pre¬ 
amplifier  in  the  handle.  A  parallel  wire  transmit sion  line  with 
tapered  resistive  thin-film  conductors  connects  the  antennas  with 
the  pteamplifier. 

Each  array  of  thermocouple  elements  in  this  probe  is  not  elec¬ 
trically  short  at  the  upper  end  of  the  frequency  range  specified 
for  its  use.  Thus,  this  probe  will  not  necessarily  give  in  accurate 
indication  of  the  electric  field  at  a  point,  but  a  weighted  average 
of  the  field  ovei  the  length  of  the  array. 

The  primary  advantage  of  the  thermocouple  probe  Is  Its  In¬ 
herent  ability  to  integrate,  via  thermal  means,  pulsed  high  level 
F-fieldt,  such  as  those  tn  the  vicinity  of  a  radar  transmitter.  The 
short  duration  of  these  pulses ( mkrroKCondt)  precludes  the  use  of 
dipole/dtode  probes.  This  is  due  to  the  wide  bandwidth  of  the 
pulses  compared  to  the  narrow  bandwidth  of  the  resistive  tians- 


4  GfMnI  Miemwave  Cory,  Farmlngdah,  NY  1 1 723. 


*  Narda  Mkrowsvv  Corp,  Hsvppaugt,  NY  11714. 
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F»  10.  £-flald  probe  with  setm  electronics. 


mission  line  (low-pen  filter)  which  mutt  ctrty  the  detected  signal 
from  the  diode  to  the  processing  circuitry  where  inte(rition 
would  occur. 

C  Probet  with  Active  Detector  Circuitry 

Probet  that  have  active  detection  circuitry  at  the  terminal*  of 
the  dipolet  were  developed  by  the  US.  National  Bureau  of 
Standard*  (20),  (21).  A  commercial  tyttem  is  now  available  tot 
use  in  the  frequency  ranee  0.01  to  220  MHz*  The  active  cir¬ 
cuitry,  a  high  impedance  RF  amplifier,  it  used  to  properly  ter¬ 
minate  the  short  monopoles  (less  chan  10  cm  in  length)  and 
obtain  a  flat  frequency  response  in  the  low-frequency  portion  of 
the  usable  range.  In  this  device,  orthogonal  monopoles  are  used  to 
obtain  isotropic  response.  The  active  circuitry ,  batteries  and 
readout  meter  are  housed  in  a  cube  with  sides  10  cm  in  length. 
The  metal  cube  also  serves  as  a  quasi-image  plane  for  Ih*  mo¬ 
nopole  antennas,  as  shown  in  Fig.  10.  The  above  sysiem  also  in¬ 
cludes  a  fiber  optic  data  link  from  the  cube  to  the  remote  instru¬ 
mentation.  The  fiber  optic  link  is  driven  by  a  voltage-controlled 
oscillator  and  a  light  emitting  diode  housed  in  the  metal  cube. 
Good  linearity,  frequency  response,  and  antenna  patterns  have 
been  obtained  in  laboratory  tests  of  this  device  over  the  fre¬ 
quency  range  10  to  100 MHz  (22) . 

D.  Summary  of  E-Field  Probe  Performance  Parameter! 

Table  I  presents  data  on  the  critical  parameters  of  the  probes 
which  were  discussed  previously;  all  of  the  probes  use  three 
orthogonal  elements  to  obtain  isotropic  response. 

IV.  APPLICATIONS  OF  f-FIEt  0  PROBES 

The  primary  application  of  f-field  probes  Is  in  the  assessment 
of  radiation  hazards.  United  States  RF/microwave  safety  stand¬ 
ards  limit  human  exposure  st  frequencies  ranging  from  300  kHz 
to  100000  MHz  to  levels  ranging  from  CO  V/m  to  600  V/m 
(rtns).  Fo:  most  radiation-safety  surveys,  a  probe  with  an  array 
of  three  small  dipoles  Is  used,  since  the  polarization  of  the  fields 
being  measured  is  unknown,  particularly  for  complex  near-field 
radiation  situations  (23|.  The  uncertainty  of  measurement  for 
this  type  of  probe  when  all  sources  of  error  are  considered  can 
approach  t2  UB  (221 .  (24) . 

*  taurammu  fer  Industry,  Fanalnadate,  NY  I171J. 
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f-field  probes  have  been  used  frequently  for  external  (ex¬ 
posure)  field  mapping  in  biological  effects  studies  involving 
animals  that  are  purposely  exposed  to  electromagnetic  fields 
(2S) .  Miniature  implantable  f-field  probes  also  have  been  used 
in  several  studies  to  internally  probe  living  or  sacrificed  animals  to 
ascertain  the  electric  field  in  specific  organs  when  the  animal  is 
exposed  to  RF/microwave  radiation  (18).  The  measurement  un¬ 
certainty  associated  with  the  implantable  f-field  probe  may 
approach  12  to  13  dB  if  the  relative  dielectric  constant  of  the 
medium  is  above  S. 

With  the  implantable  f-field  probe,  an  animal  b  typically  ex¬ 
posed  to  electromagnetic  radiation  with  a  power  density  of  I  to 
10  mW/cm5.  The  3  mm  diameter  probe  tip  measures  the  three 
orthogonal  f-field  vector  components  at  die  selected  site.  In¬ 
ternal  dosimetric  methods  other  than  the  f-field  probe  have 
almost  universally  been  thermal  in  nature,  involving  exposures  of 
an  animal  to  intense  radiation  (>  100  mW/cm*),  followed  by 
measurements  of  the  temperature  rise  at  various  points  in  the 
animal  (26) .  The  electric  field  is  computed  from  the  tempera¬ 
ture  rise  by  using  the  relationship: 

SARw  l  ol§|,/P*,d~  (16) 

where 

SAR  Specific  Absorption  Rate  (W/kg). 

|E|  Hermitian  magnitude  of  the  internal  peak  electric  field 
(V/m). 

O  Effective  electrical  conductivity  of  the  tissue  at  the 
poir.t  of  measurement,  and  at  the  frequency  of  the  ex¬ 
posure  field  (S/m). 

p  Mas*  density  of  the  tissue  (kg/m*). 

AT  Temperature  rise  (*C). 

At  Duration  of  exposure  (s). 

c  Specific  heat  of  the  tissue  (J/kg*C)- 

From  the  above  equation  H  b  easily  teen  that  the  measurement 
of  the  electric  field  strength  in  a  biological  specimen  using  a 
ttw.T" probe,  as  compared  to  a  well  designed  f-field  probe,  b 
much  more  involved  and  requires  much  more  specific  information 
about  the  exposure  and  the  electrical  and  thermal  parameters  of 
the  tissue  at  the  point  of  measurement.  Conversely,  measurement 
of  the  spec  if  t  absorption  rale  with  an  f-field  probe  requires 
knowledge  of  the  conducthity  and  the  density  of  the  tissue. 

f-field  probes  have  also  been  used  to  map  the  near-field  of 
RF/microwave  emitting  therapy  devices,  such  as  microwave  and 
shortwave  diathermy  systems  (used  to  treat  muscular  and  connec- 
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live  limit  injuries)  and  microwave  hyperthermia  aytlems  for 
cancer  treatment  [27] .  Measurements  have  been  made  of  the  in¬ 
dividual  vector  components  of  the  £-field  which  lie  in  a  plane 
parallel  and  very  close  (S  to  2 $  mm)  to  the  aperture  of  a  dia¬ 
thermy  applicator  |28] .  These  results  are  used  to  predict  the 
relative  efficiency  of  the  applicator  and  the  pattern  of  energy 
deposition  in  planar  models  of  the  human  tissue. 

Another  application  of  f-field  probes  is  assessing  the  electro¬ 
magnetic  compatibility  of  electronic  devices.  Fot  example.  E-field 
probes  are  used  to  monitor  the  field  strengths  in  the  proximity 
of  electronic  devices  which  are  susceptable  to  RF  interference, 
such  at  sensitive  medical  monitoring  equipment  (electrocn- 
ccphalographic  (EEC)  devices). 

V.  FUTURE  ADVANCES  IN  E-FIELD  PROBE  TECHNOLOGY 

In  an  effort  to  reduce  the  sue  of  the  three-axis  implantable 
probe,  a  dipole/diode  integrated  circuit  is  being  developed  with 
individual  dipole  antennas  of  total  length  2 h  —  0.6  mm  {29|. 
Special  techniques  are  being  used  to  produce  a  diode  chip  that  is 
electrically  and  mechanically  compatible  with  the  electrically 
small  antenna.  The  production  of  an  isotropic  probe  containing 
three  dipoles  with  an  outer  diameter  of  1-2  mm  is  this  program's 
goal. 

An  electrically  small  dipole  may  be  coupled  to  an  optical 
modulator  to  that  the  RF  voltage  developed  by  the  antenna 
causes  a  direct,  instantaneous  change  in  the  amplitude  of  a  beam 
of  light  passing  through  the  modulator  (30) ,  see  Fig.  1 1(a).  This 
passive  technique  provides  isolation  of  the  probe's  antenna,  just 
at  the  high  resistance  lossy  transmission  lines  co  in  present  £• 
field  probes.  The  optical  technique  hat  the  additional  advantage 
that  It  also  provides  a  very  fast  response  lime  (less  than  one  cycle 
of  the  RF  field)  and  phase  measurement  capabilities.  The  lossy 
tiansmitsion  line,  with  its  inherent  low  past  filtration,  cannot 


provide  similar  performance.  The  system  shown  in  Fig.  ?!'b) 
uses  an  integrated  optica!  modulator  coupled  to  a  User  diode 
through  a  tingle -mode  optical  fiber.  Such  a  modulator  has  been 
designed  for  use  with  a  3  cm  korg  dipole  (31].  A  flat  frequency 
response  from  one  to  several  h  indeed  MHz  is  the  design  goal, 
with  reproduction  of  the  instantaneous  (RF)  waveform  occurring 
at  a  remote  site  where  the  optical  fiber  is  coupled  to  a  photo* 
diode  detector. 


VI.  CONCLUSION 

Electric  field  probes  have  been  developed  and  used  over  much 
of  the  RF/microwave  spectrum.  Probes  arc  commercially  avail¬ 
able  for  near  and  far  zone  isotropic  measurements  of  the  magni¬ 
tude  of  the  electric  field  both  in  free  space  as  well  as  in  material 
media,  such  as  within  living  animals  used  in  biological  effects 
studies.  In  complex  near-zone  fields,  only  those  probes  whose 
maximum  dimension  is  a  small  fraction  of  a  wavelength  can  be 
expected  to  give  a  reading  that  approaches  the  value  of  the  field 
at  a  point.  Radiation  hazard  and  electromagnetic  compatibility 
surveys  make  use  of  broad-band  f-field  probes  which  yield 
uncertainties  of  I  to  3  dB  when  used  in  complex  near  field  en¬ 
vironments.  New  technologies  will  improve  the  performance 
and  reduce  the  size  of  f-field  probes  and  will  enable  them  to  be 
applied  in  other  areas. 
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Directive  Properties  of  Antennas  for  Transmission  into  a 
Material  Half-Space 

GLENN  S.  SMITH.  Hunt  mlmur.  ike 
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llrnlin  praptrUw  af  Ike  imhm  Ipassee*  lUacttM,  Mia,  eeU  dlrectlv- 
R)f).  Nawerkal  main  an  gmeated  foe  Mrialtexlwai  ikrtik  aa4  aiag- 
aelk  barlaaalal  dipole  aaltaaaa  la  a  Ifekdric  keif -space,  nfka  I.  mkt 
llndln  traetwlseMe  laia  I  be  edjecvai  kkkclrk  ball  ifit,  rt|Ua  2, 
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leaaas  arar  aa  lairrlacc  bMana  air  iM  In ab  aalri  an  M  goad  agree- 
Mtal  wiab  Iba  Sheer?. 


I  INTRODUCTION 

nPHE  ANTENNA  radiating  in  the  presence  of  a  marer-al  half- 
space  has  been  the  subject  of  extensile  theoretical  investiga¬ 
tion.  beginning  with  the  famous  icsults  of  Sommerfeld  in  190V 
1 1 1 .  The  cases  treated  in  the  literature  can  be  described  with 
the  help  of  the  drawing  in  Fig.  I.  Here,  a  geneial  antenna  is  lo¬ 
cated  at  a  height  h  above  (he  planar  interface  separating  the 
homogeneous  material  half-spaces,  regions  I  and  2.  The  elec¬ 
trical  constitutive  parameters  for  the  regions  are  the  effective 
conductivity  o„.  the  effective  permittivity  e„,  and  the  perme¬ 
ability  N|  »  No  (noth  legions,  i  «  i,  2.  are  assumed  to  be  nonmag¬ 
netic).  For  a  harmonic  time  dependence  e'“',  the  complex  wave¬ 
number  and  the  complex  wave  impedance  in  cilhei  medium  are 


*(“0l“/«i“«o(Noti),/,i  4(^0 

<U) 

fi-oww*'*. 

<ib) 

wheie  (t  m  trl( |  -  fptl)  with  the  effective  loss  tangent  prt  « 
•rl/w* el- 

The  electromagnetic  field  of  the  antenna  in  the  presence  of 
the  half-space  is  conveniently  discussed  in  terms  of  Us  asymptotic 
expansion  for  large  radial  distance  (lim  ktr  —  •)  In  either  me¬ 
dium,  the  leading  term  in  the  expansion,  which  is  0(«xp(-/A(e)/ 
*)fj ,  is  referred  lo  as  the  space  wave,  this  is  the  "geometrical  op¬ 
tica"  solution  for  ihe  field  |2).  Near  the  inlerface  (angles  near 
I  m  * 12  in  Fig.  1).  where  the  “geometrical  optics"  solution  is  rero, 
terms  of  0(lfkfn)  may  be  dominant.  There  terms,  which  a.c  im¬ 
portant  mainly  for  representing  the  field  neat  the  inlerface,  ate 
often  generkalty  referred  lo  as  surface  waves,  although  This  ter- 

Maauicripl  received  Much  J,  l»»J;  revised  October  IT,  IW1  This 
•vi  »«s  wpporied  in  p«n  by  ihe  Joint  Services  Ekvtioaici  Program  vn- 
kcr  Contract!  DAAC24H-C  CHIOS  end  DAAC2«-g|-k  0024. 

Tbe  sethof  n  w«h  the  School  or  i.lcctrica)  Engineering.  Ceorgie  I  naif - 
tele  or  Technology,  Atlenie.  GA  10}  12. 


minology  may  be  somewhat  misleading  as  pointed  out  by  Schel- 
kunoff(3). 

The  early  investigators  of  this  problem  considered  antennas 
(usually  infinitesimal  electric  and  magnetic  dipoles)  in  air  (region 
t)  over  a  planar  earth  (region  2).  with  the  application  being  to 
radio  wave  communication  in  ihe  atmosphere  |4) ,  |S) .  The  field 
in  the  air.  both  the  space  wave  and  the  surface  wave,  was  of  pri¬ 
mary  interest.  The  surface  wave  for  this  care,  which  has  the  as 
ymplotic  behavior  0(1/* Jr1),  has  been  referred  to  by  Walt  as  the 
"Norton  surface  wave"  |6|. 

A  second  case,  which  has  received  considerable  attention,  is 
that  of  an  antenna  in  a  highly  disaipaiiie  half-space  (region  I) 
with  the  adjacent  half-space  (region  2)  usually  being  free  space 
PI- 1*1-  Fnr  example,  the  antenna  may  be  on  a  submarine  boat 
below  ihe  surface  of  Ihe  ocran.  For  this  care,  Ihe  field  near  the 
interface  is  usually  of  interest;  It  is  described  by  ihe  so-called 
"lateral  wave,”  which  has  the  asymptotic  behavior  (X l/k’r1 ). 

The  case  considered  in  this  paper  is  lhai  of  directive  transmis¬ 
sion  from  the  amenna  into  the  adjacent  half-space  (region  2)  by 
means  of  the  space  wave.  This  care  has  received  little  attention  in 
the  literature, even  though  its  mathematical  description  is  straight¬ 
forward  and  often  leu  complicated  than  that  of  Ihe  surface  waves. 
The  reason  foi  this  is  fairly  simple.  The  spree  wave  is  exponentially 
damped  (exp  ( — a7r)|  when  Ihe  medium  (region  2)  is  dissipative; 
this  limits  Ihe  radii  with  useful  field  strengths  lo  impiaclicably 
small  values  for  many  purposes  However,  there  are  applications 
with  anlennas  used  to  transmit  a  signal  into  the  adjacent  half- 
spree  ai  points  dueclly  below  the  antenna  (polai  angles  near  0  « 
0  in  Fig.  I ).  For  example,  anlennas  above  the  sorface  of  ihe  earth 
may  be  used  In  a  communications  link  with  underground  tunnels 
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of  to  truumll  sad  receive  the  sijntJi  used  in  s  tyiism  to  detect 
bulled  objects  |9| .  When  the  diuipetion  in  the  tnedu  Is  not  ex¬ 
cessive .  the  H>sce  wire  cm  provide  s  useful  description  of  the 
sntenns's  direcuve  properties  for  these  sppllcstionL 

The  motivation  foe  this  work  was  a  recent  investi|st.on  by 
the  author  and  An  of  the  directive  properties  of  horizonul  cir¬ 
cular-loop  an-rnnss  over  a  msterlsl  half-space  |I0).  (11).  That 
study  showed  that  a  loop  in  free  space  over  a  material  half¬ 
space  could  have  a  directive  field  pattern  into  the  half-space 
when  (he  loop  i>  dose  to  the  interface  Md  near  resonant  site 
(the  circumference  of  the  loop  approximately  one  wavelen|ih  in 
free  space).  The  directive  properties  of  a  rtsona.it  loop  in  air  over 
fresh  water  are  illustrated  by  ihe  theoretical  and  experimental  re¬ 
sults  presented  in  Fig.  2.  The  electric  field  patterns  in  Fig.  2(a) 
are  for  an  isolated  loop  in  air  (dashed  line)  and  for  the  same  loop 
In  air  over  fresh  water  (solid  line).  The  quantity  r“ 'I  I  is 
graphed;  the  exponential  factor  is  included  to  compensate  for  the 
dissipation  in  the  water.1  The  nearly  circular  pattern  of  the  iso¬ 
lated  luop  is  seen  to  change  to  a  direc'ive  pattern  into  the  water 
when  the  loop  is  placed  over  the  interface.  The  relative  gain  of 
the  loop  for  transmission  into  the  water  6(9  =  0).  as  shown  in 
Fig.  2(b).  has  a  peak  when  the  loop  is  close  to  the  interface. 
hfXo  *0.075.  The  peak  gam  is  about  60  time:  the  pin  of  die  iso¬ 
lated  loop. 

In  this  paper,  the  field  of  a  geneial  antenna  over  a  material 
half-space  it  expressed  as  a  spectrum  of  plane  waves.  The  resulting 
integrals  arc  evaluated  asymptotically  to  obtain  the  "geometrical 
optics"  field,  and  this  is  used  to  define  quantities  that  describe 
the  directive  properties  of  the  antenna.  Numerical  results  for  in- 
finiresrmal  electric  and  magnetic  horizontal  dipole  antennas  are 
provided  as  illustrative  examples.  The  theory  completely  ex¬ 
plains  the  aforementioned  directive  properties  of  Ihe  horizontal 
circular-l-wip  antenna  over  a  material  half-space. 

II  SPECTRAL  REPRESENTATION  FOR  THE 
ELECTROMAGNETIC  FIELD 

The  geometry  to  be  used  foi  the  antenna  over  the  material 
half-space  is  shown  in  Fig.  I.  Two  rectangular  coordinate  systems 
are  shown:  the  unprtmed  system  (x.y.z)  with  origin  O  on  the  in¬ 
terface  and  the  primed  system  (ar \y  .  z’)  with  origin  Of  on  the  an¬ 
tenna.  The  antenna  is  enclosed  by  the  two  planes  at «•  f  ..  -i  _ 
(z  =  -h  +  6..  -h  -  6  _);  these  are  parallel  ro  the  intcrfacial 
plane,  z’  «  h  (z  -  0) 

The  spectral  analysis  is  based  on  i  knowledge  of  Ihe  incident 
electric  field  E,(x',y‘,  z’)  on  the  two  plan.s  enclosing  the  antenna, 
z  *  *«,  The  incident  electric  field  is  the  field  o(  ihe  antenna 
when  it  is  isolated  in  an  infinite  medium  with  the  electrical  prop¬ 
erties  of  region  I.  and  the  current  distribution  in  the  amenni  is 
taken  ro  be  the  same  as  tl  is  when  the  antenna  is  over  the  half¬ 
space.  For  the  analysis  presented  in  (his  section,  the  incident  elec¬ 
tric  field  on  the  planes  enclosing  the  antenna  is  assumed  to  be 
known;  it  will  be  determined  for  a  few  simple  antennas  in  Sec¬ 
tion  IV. 

A.  Isolated  Antenna 

The  electromagnetic  field  of  ihe  isolated  antenna  is  expressed 


I  The  definition!  for  the  field  pill  vn  and  the  pin  uf  an  antenna  over  a 
half-space  arc  dtxcusued  H)  SnIMf  III. 


tad’ 


-  THtOftV  |  LOO*  OVER 

m...  EXFCfltMCNT  | 

- 1MEOXV.  LOO*  IN  AIR 


Ul 


ft) 

Fif .  2.  Duccio*  properties  of  resonant  circular  loop  antenna  *  1 .0) 
tntr  interface.  (i)f  eld  patterns  fur  nutated  loop  in  air  and  for  loop  in 
ait  over  fresh  »ticr.  •  0  075.  •  5.5  X  10“V  The  pattern  for 

the  notated  loop  n  a  fai-ione  pattern:  those  fur  (he  loop*  over  water 
are  for  the  ndwt  f/  a  S  15.  lb)  Retain*  pain  for  loop  in  air  above 
fresh  water  as  a  function  of  the  hcipht  above  the  interface  hi Kq.  The 
iheoretiraJ  results  are  for  lossless  media.  while  the  experimental  results 
are  for  prj  •  $.J  x  10-7. 

as  a  spectrum  of  plane  waxes: * 

•  exp  (-M|  '?)dK.  (2*) 

HHx',y\z  )-~  ff  |«jAj(A)  —  i^Aj(A")J 

•  exp  (-ft }  •  r’)  dK.  (2b) 

where  dK  »  dk,dky.  Here,  the  plus  end  minos  superscripts  refer 

to  the  fields  in  the  regions  z'?  6.  tndz'  <  -S_.  respectively. 


*  The  dctJth  uf  the  spectral  are  omitted  here,  and  unlx  the 

final  remits  are  summartred.  A  dmauma  ol  the  p envoi  procedure  n  in 
several  texts.  1 7 1 .  1 1 2 1 .  and  1 1) | .  The  notation  used  h  ruuphly  the  same 
as  that  in  Ihe  monupraphby  Krrm|l2|. 
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The  components  e (  the  apectril -density  function  X  an  A, 
which  b  parallel  to  th«  plant  of  incidence,  *te,  the  plane  formed 
by  the  propagation  vcctoe  (,  tnd  i'.  end  At  which  it  normtl  to 
the  plane  of  incidence: 

J‘(*)  ~  <j) 

Dm  spactral-dcnslty  function  it  determined  from  the  tangential 
component  of, the  electric  Held  on  the  ptttilel  planet  encloiini 
the  antenna,  £?(*',  /,  /’  «  id,).  From  the  formula  for  the  in¬ 
verse  two-dimensional  Fourier  transform  and  (2). 

ft* 

*ri  LL 


•  dS\ 


««P  t/T|(*)M 
A 


.  • 


•  |1*  X  &*'./.  i’-**,)!  (4b) 

where  dS‘ «  dx’dy  . 

The  plane-wave  propagation  vector  it  j?  i 

i\mXty,(K)}.  (St) 

with  the  transverse  component  1 

K*kjc+kry,  K  -  v4]  +  *J.  (So) 

From  the  relation 

-*i  -*i.  (Sc)  i 

it  follows  that 

T.ffO-ViT7^.  (6a) 

where  the  branch  of  the  square  root  is  chosen  so  that  The 

7i(*)--fVKI -*?  (6b)  Mo» 

t  m 

when*,  it  real  and  A'3  >*}. 

The  unit  sectors  in  the  spectral-density  function  (3)  are 

tl m 

*|*  A  • 

Note  that «,  is.  in  general,  a  complex  *e<-tor,  whereas  e,  is  always 
t  real  sector  when  k,  and  *,  are  real.  Both  eff  and  cL  are  orlhog- 
0"-.l  to  the  propagation  sector  *,.  The  geometrical  relationship 
between  these  sectors  when  *,  and  d,  are  real  is  shown  in  Fig.  3 . 

*  Awtrm*  Ovtr  Httf  Sport  wh 

The  electromagnetic  field  incident  on  the  interface  separating 
the  two  material  region;  is  the  field  of  the  isolated  antenna  (2). 

After  a  change  to  the  unp.imed  coordinate  system  (jr ,y.  r),  the  *n<* 
incident  electric  field  becomes  , 


Fig.  1.  Geometrical  relationship  between  wave  sectorsf  and  unit  vectone. 


£,  in  region  2: 


EAx.y.f) 


•£fl“ 


*«P  Ht,(E)b-lk\‘r\dK. 


(5c)  i  I  ff 

|i;rl(*M;(A)+i'r1(AMl(A)) 

(«»)  ***P|-fTi(*)»->f,*?JdA-.  z  >0.  (10) 

The  reflection  and  Uansmisskm  coetncienls  in  the  abuse  cqua- 
(6b)  'i0*11  ***  obtained  by  imposing  the  boundary  conditions  at 


*•(*>- (*5t.  -*JtiW*?7.  ♦*?»!> 

*.(*')- (T,  “TjV(7i +>t) 

r,(A)-a:,*,7iA*l7, +.*?7i) 

F’rfA’)  ”  2y,(7|  +7,) 

The  propagation  sector  for  the  transmitted  field  is 
(,-*  +  7j(A)S. 


7j(*>  -  VST-  A*  -  VA|-*?-t-Tj(A  ) . 

The  branch  of  the  square  root  in  (14a)  is  chosen  so  that 

ri<X)--rATZ7j 


4-j!d{(^)e,/ttf*,*|exp(-f*}  •?\dK.  (•)  ,*h»"  *»*•  real  and  A»  >*> 

The  unit  sectors  used  in  the  expressions  for  the  reflected  and 


The  incident  fleld  interact,  with  the  half-space.  region  2.  to  give  «.•*.  WendflO).  are 

riae  to  the  reflected  field  *,  in  region  I  and  the  tiananitttd  field  KmK  mK-  A  m»i 
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4- 


-*»!  +  „(<)* 


k,K 


(l») 


la  the  rtgioo  i  <  -i  ■  i.  ihi  lolal  field  £,  is  the  incident 
field  l;  plus  the  teflected  field  £,,  end  in  the  teflon  r  >  0  the 
total  field  fj  Is  the  trenanitted  field  £,: 


ll 


+*,(*><  a*) 


.  e-/t  i  <*  I*  j }  exp  (~/*7  • ')  dK.  z<-A-«_  (16) 


£'j  (*.?.*) 


l«[r,(A'  Httf)  +  i[Tl(K)A\(K)] 


•  «*P  (-/Ti(Ay>  -/*,  •  ?J  dK,  :>0  (17) 

C.  Ctomtltical  Optics  Field 

Equations  (2a).  (16).  and  (17)  ate  integral  representations  for 
the  electric  field  of  the  Isolated  antenna  and  the  antenna  oser  the 
half  space.  For  latge  radial  distances  from  the  antenna  dim  k,r  - 
«■),  th-se  integrals  can  he  evaluated  asymptotically  by  the  saddle- 
point  method  of  integration.  The  technique  is  well  documented 
in  the  literature  and  will  not  be  described  here  |2j.  (7 J .  and 
(I4|.  The  leading  term  in  the  asymptotic  expansion,  which  is 
0(exp  (-/*,r/*,e)|.  is  ilie  “geometrical  optics"  field,  and  this 
term  is  a  valid  representation  for  the  field  provided  the  point  of 
observation  (r,  0. 0)  is  not  near  the  interface  (9  not  close  to  »/2). 

The  “geometrical  optics”  field  w"  he  referred  lo  as  the  far- 
rone  field  and  indicated  hy  the  superscript  r.  e  g..  £'(r,  6.  p). 
The  far-aone  field  for  the  isolated  antenna  is 

-  ,  .  ,  / 

£t(r .9  .Q)m—  - —  Jr,  | cos 6' I' 

2e  r 

•i«vj;(a;,)  +  pu;(a*;,)]  o«) 

with 

K',i  m*i  »in*'|i'cosp’+p'tinp'|.  (19) 

and  the  far-aone  field  for  the  an.enna  over  the  half-space  is 

/  e-'*!' 

mu  ~T~ 

♦  *.(*.i)d;(A>l)f-<*t»iv««i, 

4  9Mr(A’„ )e'* i *"««*•  •  +  #|(A'„)4l(j?fl) 

,,.,tl»i..,atj)  »/2  <  9  <  «  (20) 


2JS 


GVr.t.i) 

l-K*i'  *  >Ai -*j  sin*  9h)) 


•*a  I  cos#  ll»7-,(J:ijV4;(A,1J) 


+  frs(*.aV«;(*ai)l.  0  <  •  <  »/2 

(21) 

with 

jE*  -  *,  sin  9(i  cot  0  +>  tin  9|, 

(22a) 

tta9.  1-1.2. 

(22b) 

The  reflection  and  transmission  coefficients  fll)  and  (12) 
evaluated  at  the  transverse  propagation  numbers  if,,  and  A,j, 
tespectivcly,  arc  simply  the  Fresnel  coefficients: 

_  „  ,  *!i  Icosd  1  — V*j,  -  un*  9 
“  Jj.lcosdl  +  Vjj,-^* 

(23a) 

(23b) 

|cos9|  +  x/Jj,  -sin1  0 

....  .  2*,,xA]a-sir.39 

1  l(A|l)  —  f— - — 

Aj,  I  cot  9 1  Wj},  —  tin39 

(24a) 

2>A|)  —  tin1  9 

UK,  a)- - 'jr- - — 

|cos9 1  +  V*}2  -  tin3  9 

(24b) 

where*,,  ** 1 

III.  DESCRIPTION  OF  DIRECTIVE  PROPERTIES 

For  directive  transmission  into  a  material  half-space,  one  is  pri¬ 
marily  interested  in  antennas  that  concentrate  the  electromagnetic 
field  in  region  2  at  polar  angles  near  9*0*  and  minimise  the 
electromagnetic  field,  or  power  radiated  and  dissipated,  in  region 
I.  The  greater  the  concentration  of  the  field  or  the  gain  is  in  the 
direction  9  —  0*.  the  greater  the  depth  of  penetration  in  the 
half-space  is  before  the  exponential  damping  |exp  f  -  o}  e ))  reduces 
the  field  loan  impracticably  small  value. 

In  this  section,  quantities  will  be  defined  that  describe  the  di¬ 
rective  properties  of  a  general  antenna  ot.-r  a  half-space.  For  the 
most  part,  these  quantities  are  simply  the  familiar  ones  used  to 
dcKrfbc  the  directive  properties  of  antennas  in  free  space,  modi¬ 
fied  to  account  for  the  fact  that  there  arc  two  material  regions 
and  that  these  may  be  dissipative. 

A  Fttld  and  Powtr  Denary  Pttttmt 

The  directive  properties  of  an  antenna  in  infinite  free  space 
are  described  in  terms  of  the  electric  field  or  power  density  pat¬ 
tern.  The  electric  field  pattern  function  £,0(9.  9)  is  obtained 
from  the  fat-zone  or  “geometrical  optics"  field: 

(25) 

where  the  frce-space  wavenumber  k0  is  te.T.  The  far-rone  field 
pattern  is  a  graphical  representation  of  a  vec.o;  component  of 
.  while  the  far-aone  power  density  pattern  is  a  graphical  rep¬ 
resentation  of  the  function 

F,t(9.9)*r*r-  Re|i;(r.» ,♦)!*•  >£«(•.») |a/2(a.  (26) 

Here,  it  the  complex  Poynting’t  vector  in  the  far  zone.  Re  tig- 
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nifiet  the  real  part,  and  the  magnitude  symbol  indies  in  the  Her- 
mitian  magnitude.  I  t..  \A  I  “  (A'A •)* 11 .  Both  A,«  and  F,0  •'* 
independent  of  e. 

For  the  amenna  over  the  half-space,  cither  or  both  of  the  re¬ 
gions  may  be  dissipative,  and  the  definitions  (2$)  and  (26)  must 
be  modified  to  account  for  the  resulting  complex  wavenumber 
and  complex  wan  impedance: 


F,(0,g>)  “  re'tf£\r,e.  •) 

(27) 

/>(«. «  =  rV*'r  •  Re  l£(r, ». «)) 

-R*(r,)|f,(«.»)|,/2ltJ>. 

(28) 

where  /  ■  I .  */2  <  9  <  »:  f  ■  2.  0  <S  <nl 2.  The  exponential 
factors  in  (27)  and  (28)  compensate  for  the  damping  due  to  dis¬ 
sipation  and  make  both  and  Ff  independent  of  r. 

It  is  inipoiiant  to  note  that  the  power  density  patterns  ob¬ 
tained  from  (2d)  may  differ  greatly  from  the  electric  field  pat¬ 
terns  obtained  from  (27)  when  the  electric  niopeities  of  the 
two  regions.  I  and  2.  are  very  different.  Apai>  from  the  squaring 
of  If,1  in  (28).  this  is  a  consequent  of  the  wave  impedance  f, 
appearing  in  ( 28)  and  not  in  (27).  l-or  example,  consider  the  case 
when  both  media  are  lowloss  dielectrics  (o„  «0,i=  1.2)  and 
e,|  =  f o  (free  space).  e,/2  =  rrJ/r0  »  81  (water).  The  real  wave 
impedances  for  the  two  regions  are  f ,  =  f0  and  f  j  =  f0/9.  The 
power  density  function  in  region  I  is  Ff  -  \F,  |1/2fa  and  that  in 
region  2  is  Fp  =  9  I  A,  i3/ 2f0 .  Thus,  in  converting  the  field  pat¬ 
tern  to  a  power  density  pattern,  apart  from  the  squaring  of  I F,  I. 
the  graph  in  region  2  (water)  is  increased  by  a  factor  of  nine  over 
that  in  region  I  (free  space). 


B.  Gain 


The  gain  ol  an  antenna  in  infinite  free  space  is 


- .  4WV-R«(5'c(e.ff.O)j 

Go<y.  <•)  *  - - • 


(29) 


where  P,„  is  the  time-average  power  supplied  to  the  antenna.  Note 
•bar  the  gain  is  independent  of  the  radial  distance  r. 

The  definition  (29)  will  noi  suffice  when  the  antenna  is  over  a 
half-space  and  either  inedi  im  is  dissipative.  In  this  case,  the  nu¬ 
merator  of  (29)  will  contain  the  exponential  term  exp  (-2o,r). 
therefore,  the  gam  will  not  be  independent  of  r.  A  similar  prob¬ 
lem  exists  when  the  gam  of  an  antenna  in  an  infinite  dissipative 
medium  is  considered  |b| .  |  IS) .  The  following  definition,  which 
is  independent  of  r,  is  proposed  for  the  gain  ol  the  antenna  over 
the  half-space: 


C(M) 


4ire>e>°i'e  -  Re  |.T'(r,  t,  p)} 

'.i. 


3»  R« ({,) I />(».»)  1* 


(30) 


where  (  -  I.  tr(2  <  $  <  t:  <  •  2.0<  6  <  W2  The  gam  in  the  di¬ 
rection  4  0  is  of  orimary  interest:  after  inserting  the  spectral 

representation  for  the  field  (21).  it  becomes 


<*»  -  o)  - — *  0)|1 . 
» If  J  l,H4*,Jl,r. 


(31) 


wo  special  cases  aie  of  interest  the  cate  with  region  I  loss- 
less  la,,  -  0).  suen  as  an  amenna  in  free  space  o*e»  the  earth. 


and  the  case  with  regia*  I  and  region  2  both  lossless  (o,,  «  0. 

e„  -  0). 

When  region  I  It  lotArts  and  the  antenna  is  alto  lossless,  the 
time-average  power  input  to  the  antenna  Ptn  is  equal  to  the  total 
time-average  power  pasatg  outward  through  the  parallel  planet 
at  r  »  -  A  +  8,  and  a  »  -A  -  4_ : 

+  +  (32a) 

with 

ti  •  Re  |S^*.>.g)l  < IS.  (32b) 

and  dS  *  dx  dy  After  inserting  the  spectral  representation  fpr 
the  field  into  (32b).  performing  the  spatial  integration,  and  con¬ 
siderable  reduction,  the  two  components  of  P,„  become 


B*  *|f| 

+  m:i*c  -  WU’)!  dk-2  fl  It,  It  Mj  |* 

«>t| 

•  lm(R,e~ :  »•'*)♦  |*  dK  (33a) 


-r^TT  //  T,l Mr l*  +  1*.  i*  MI  i* 

8*  “ill  J[<.1 

+  2  Re  v«»t*)  +  \Al  I*  +  |i*A  |*  Ml  I1 

♦  2R*(R1i4lA;V,<7t*)]d)t,  (33b) 

where  Im  signifies  the  imaginary  part.  When  the  integrals  for  K 
and  P~ ,  (33a)  and  (33b),  are  combmed  and  the  double  integral 
is  converted  to  one  with  respect  to  the  cylindrical  coordinates 
p.  <!>'■ 

P  =  k/k,  *  +*’/*,.  d  -  tan" '  (A  ,/*„),  (34) 

PtK.  for  region  I  lossless,  becomes 


7i(l-4*  |*  +  |A“  |* 


♦  2  r«  KR^^r* 

-2  f  (  It,  II M;  I*  M*i) 

+  M*  |*  Im (A? ,  )|  e"  * ,T  ■  ^pdi dpi.  (35) 


Note  that  ihe  speclial  densities  At  and  A,  in  (35)  are  functions 
of  p  and  d>.  while  y,  and  ihe  reflection  coeflicicnis  AJ,  and  At, 
are  functions  o(  p  only . 

WTieii  both  regions  /  and  2  tre  lossless,  a  directivity  in  tht  di- 
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23? 


rsc boo  •  -  0  oa  bs  defined  for  the  antenna;  It  is  equal  to  the 


(Sin: 


j j‘r  -  X,\S’A>.*.*)\  dQ. 


_  2*i  |A’(A  -  0)1* 

The  power  input  In  (36)  d  given  by  (35).  but  with  the  upper 
limit  on  the  second  integral  no  longer  p  «  •*.  For  k]  >  ft,,  fl. 
snd  Ki  si e  both  rest  for  p  >  ft,, :  this  makes  the  upper  limit  p  - 
in.  For  t,  snd  X,  sie  both  red  forp>  I ;  this  makes 

the  upper  limit  p  “  I  snd  the  tsluc  of  the  second  integral,  there¬ 
fore.  sero. 


IV.  NUMERICAL  RESULTS  AND  DISCUSSION 
The  spectral-density  function  Ad?) mutt  be  known  before  the 
electric  field  or  the  gain  for  a  specific  antenna  can  be  computed. 
Recall  from  (4).  that  the  spectrahdensity  function  is  deteimined 
from  the  incident  electric  field  l',(x\  y\  a')  on  the  two  planes 
(z‘  «  IS,)  enclosing  the  antenna.  The  incident  field  is  the  field 
of  the  isolated  antenna  with  the  current  specified  to  be  the  same 
as  that  in  the  antenna  ovei  the  half-space. 

Antennas  formed  from  thin  wires  lying  in  the  iioriaontal  plane 
z  “  -k.  such  as  horizontal  linear  and  loop  antennas,  can  be  ap¬ 
proximated  by  a  current  sheet  for  computing  the  electromagnetic 
field.  The  surface  current  density  in  the  sheet  is  K,  for  an  electric 
current  sheet  and  ICM  for  a  magnetic  current  sheet.  For  example, 
the  horizontal  arculai-loop  antenna  discussed  earlier  can  be  re¬ 
placed  by  the  electric  surface  current  density  K,  *  /(o')S(p'  - 
S)0'.  where  p"  is  the  radial  distance  on  the  plane  z*  -  0. 

The  components  of  the  spectral-density  function  A,  and  At 
are  easily  computed  from  the  surface  current  densities,  for  the 


titctric  current  sheet 

-Wi-£  [ 

1  A-A,e/*;«fS' 

07a) 

j  K  •(iXK.)etl 'rtf'. 

(37b) 

and  for  tht  mtgnttic  current  sheet 

f  K-(!x  £„)*>*  '  dS‘ 

(J»*) 

r  -  «  «v 

K-Kme>*-’  rtf'. 

(3«b) 

The  exact  current  in  the  antenna  or  an  approximation  to  it 
can  be  used  in  (37)  and  (38).  A  discussion  of  the  analytical 
methods  for  determining  the  current  in  the  antenna  over  the 
half-space  is  beyond  the  scope  of  the  present  work,  detailed 
analyses  for  horizontal  linear  and  circular-loop  antennas  are 
in  references  |8|  and  (10) .  respectively.  Only  the  simplest  cur- 
rent-shect  antennas  wg|  be  considered  here,  infinitesimal  di¬ 
pole  antennas.  The  results  for  these  radiators  art  indicative  of 
the  directive  properties  of  more  complex  radiators  ov>r  the 
half  Space.  The  fat-rone  fields.  (20)  snd  (21).  and  the  dire-.iivities 
(36)  for  the  infinitesimal  dipoles  arc  obtained  m  the  Appendix. 


The  ‘‘geometrical  optics’*  or  far-aone  field,  as  previously  men¬ 
tioned,  Is  a  useful  description  of  the  antenna's  directive  proper¬ 
ties  when  the  media  have  *ow  lost  (p„  <  I .  i  «  1 .  2).  In  this  in¬ 
stance,  the  reflection  and  the  tranuniasion  coefficients  (23)  and 
(24)  are  primarily  determined  by  the  dielectric  properties  of  the 
media  (e„  snd  e(J).  and  the  exponential  damping  (dissipation) 
experienced  by  waves  propagating  in  the  space  between  the 
antcnr.i  and  the  interface  (-A  +  i,  <  z  <  0)  is  negligible 
(exp  (-a.  A)  *  t|.  Thus,  for  media  with  low  loss,  the  expres¬ 
sions  for  the  electric  field  (20)  and  (21).  apart  fiom  the  expo¬ 
nential  factor  leap  (~/k,r) |  -  exp  (-o,r).  ate  approximately  the 
same  as  those  for  lossless  media  (p„  »0.l«  1.2). i.e.,  the  pat¬ 
tern  functions  (27)  and  (28)  ate  approximately  the  tame  m  both 
cases.  Therefore,  a  useful  description  of  the  antenna's  directive 
properties  for  media  with  low  lots  d  obtained  by  considering 
lossless  media.1  and  only  the  case  of  lossless  media  with  *,,  >  1 
wilt  be  discussed  in  detail  here. 

In  Figs.  4  and  5,  fat-zone  electric  field  patterns  for  electric  and 
Magnetic  horizontal  dipoles  ate  shown  for  lossless  media  with  the 
ratios  of  wavenumbets  ft,,  “  2  ((,]/<# i  ”  4)  and  ft,,  «-  894 
(e,]/r,|  »  80).  These  two  values  of  ft,,  roughly  correspond  to 
the  extremes  of  dipoles  in  ait  over  dry  earth  and  dipoles  in  air 
over  fresh  water.  In  each  figure  the  px'lttns  lor  the  electric  di¬ 
pole  are  I  £t  |  m  the  plane  p  »  »/2 . 3»/2  and  I  £t  I  in  *!.e  orthog¬ 
onal  plane  p  •  0.  while  the  patterns  for  the  magnetic  dipole  are 
|£(  |  in  the  plane  p  «  0.  »  and  |£'#|  in  the  orthogonal  plane  p “ 
•12,  3»/2.  Results  are  shown  for  the  dipolrs  at  three  heights 
above  the  interface  A/X,  «  0JS.  0.1  and  0.  Note  that  all  of  the 
patterns  ate  scaled  to  have  a  maximum  electric  field  of  one  unit, 
except  the  ritftt  most  graphs  in  Fig.  S  which  have  been  scak.1 
with  the  factors  indicated  on  the  graphs.  The  unusual  shapes  of 
these  field  patterns  will  be  explained  by  comparing  the  far-zone 
field  of  the  isolated  dipoles.  (48).  with  the  fat-zone  field  of  the 
dipole i  over  the  half-space.  (46).  (47). 

The  far-zone  field  in  region  I  is  simply  the  superposition  of 
the  field  of  the  isolated  dipole  with  the  field  or  the  isolated  di¬ 
pole  after  reflection  from  the  half-space.  For  example,  for  the 
electric  dipole,  the  component  I  A  ',,  I  in  the  plane  p  -  »/2.3»/2 
b 

|£(((r,P)|*|  l-*,(A„)e-,,*'*'w"ll  £i(r.#) I.  (39) 

where  \£\g  I  b  the  field  of  the  isolated  dipole.  The  exponential 
factor  in  (39)  k  the  phase  delay  due  to  the  round  trip  of  the 
waves  fiom  the  antenna  to  the  interface.  In  Fig.  6.  ihe  amplitude 
and  the  phase  of  the  plane-warn  teflection  coefficients  X,  and  X, 
arc  shown  as  functions  of  the  angle  of  incidence  $’  for  the  case 
ftg  i  ■  2.  Since  Xa  b  positive  fur  angles  near  9  «  180*  (angles  of 
incidence  $'  in  Fig.  6  that  are  less  than  the  Brewster  angle  fig  • 
tan” 1  (ft|  | )),  the  amplitude  of  the  beck  lobe  of  the  pattern  (the 
lobe  in  region  I)  wdi  be  minimum  when  A/X|  “  0  and  oscillate 
with  increaiing  A/X, ,  having  the  first  maximum  at  A/X,  w  0.2$. 
This  behavior  b  clearly  illustrated  in  the  field  patterns,  and  it  is 
most  pronounced  when  the  ratio  In  b  large,  at  in  Fig.  5. 

For  the  magnetic  dipole,  consider  the  field  component  I  ,  I 
in  the  plane  p  »  0. »: 

lt;e('.«>l-«»  ♦*«<*.  I)*‘1*,*,t“,'ll#e('.»>l  H0> 


i  Tint  point  k  ifluttiitftf  by  tptctfi:  multi  lot  Ihe  arcvlar4oop  ta¬ 
rtan 4  in  f  I0|  «n4|JtJ. 
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Thii  is  (he  tame  expression  a<  (39)  for  (he  electric  dipole  except 
for  a  change  in  tign.  ~Rt  it  replaced  by  +/?,.  The  amplitude  of 
the  back  lobe  in  the  pattern  cf  (he  magnetic  dipole  it  teen  to  oscil¬ 
late  with  increasing  A/A,,  but.  unlike  the  back  lobe  for  the  elec¬ 
tric  dipole,  it  begins  w  ith  a  maximum  at  A/A,  *  0  and  h:*  'he  first 
minimum  at  A/A,  »  0.25. 

The  oscillation  in  the  amplitude  of  the  back  lobe  with  increi  • 
ini  A/A,,  observed  for  the  dipole  antennas,  ^characteristic  of  all 
current  sheet  antennas*  The  back  lobe  for  A/A,  »  o  is  a  mini¬ 
mum  for  electric  current  sheet  antennas  and  a  maximum  for  mag¬ 
netic  current  sheet  antennas.  This  behavior  is  simply  a  result  of 
the  symmetries  of  the  electromagnetic  fields,  or  spectral-density 
functions,  produced  by  the  two  kindtof  current  sheets. 

The  far-zone  field  in  region  2.  the  transmitted  field,  is  the  field 
of  the  isolated  dipole  after  refraction  in  the  half  space.  Consider, 
at  an  illustration,  the  field  component  \k I  in  the  plane  0  ■ 
»/2,  3»/2  for  the  electric  dif  ole,  or  the  same  component  in  the 
plane  d  •*  0, »  for  the  magnetic  dipole  . 


U$.(*al)l 


■  P^r,(A. 
Pma'.i) 


•  exp  (-|lm|y,(A',,)|  |A}  |/.,’((A'1])|. 


A  spectral  component  of  the  transmitted  field  t  \  with  the  propa¬ 
gation  vector  i ,  m  K, ,  ♦  y,(A'll)z  is  seen  to  arise  from  a  spectral 

4  Here,  the  current  distribution  Ur  the  antenna  n  aiurmed  not  to  change 
with  the  herghl  A/X , . 


component  of  the  isolated  antenna  with  the  propagation  vector 
*i  *  *j2  4  7|(A'i2)*- The  longitudinal  component  7i  and  7j  of 
the  two  propagation  vectors  are  related  by  Snell's  law: 

3K*.z>+*i-*>  (42) 

When  d.  the  angle  at  which  the  field  in  region  2  is  evaluated,  and 
the  angle  at  which  the  field  of  the  isolated  antenna  is  evaluated, 
art  introduced.  (421  takes  Uie  familiar  form 

,m  I  M"'1  <*»■  “n®>-  i<‘9r  (4J) 

j  */2 +/ cosh" 1  (*j,  sin d),  d>d<- 

where  dr  »  sin'1  (*,a)  is  the  critical  angle  for  ptopagation  from 
region  2  to  region  I 

Fig.  7  is  a  graphical  representation  of  the  relationship  between 
the  angles  t'  and  d  (43),  oi  the  longitudinal  components  >,(A,j) 
anJ  Tj(A,j)  of  the  propagation  veciors(42).  for  the  case  of  loss¬ 
less  media  with  A,,  «  2  the  fai-/onc  transmitted  field  at  thean- 
glet  0  <  d  <  d,  is  seen  to  arise  from  the  refraction  of  the  spectral 
components  for  the  isolated  antenna  that  represent  propagating 
waves  in  the  direction  z.  I  e.,  wares  wilh  ?,(A',jVA|  s  rest  num¬ 
ber  or  0  <  d'  <  »/ 2  While  the  far-zone  transmitted  field  at  the 
angles  d,  <  d  <  »/2  is  seen  to  arise  from  the  refraction  of  spectral 
components  of  the  isolated  antenna  that  represent  evanescent 
waves  in  the  ditcrtion  z.  I  e  ,  7,(A,j)'*i  a  pure  imaginary  num¬ 
ber  oe  d’  a  complex  angle  Thus,  the  propagating  spectrum  of  the 
isolated  antenna  produces  the  far-zone  transmitted  field  within 
the  cone  of  angles  0  <  d  <  dc,  while  the  remainder  of  the  far- 
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F*a  5  Electric  OcM  patterns  fnr  dipoles  at  various  heights  AA,  above  interface  between  loitlnt  dielectric  media.  Ijj*  S.94 
(fr]/r,i  ■  SO  0).  Note  change  in  Kale  for  L'e of  mainetic  dipole. 


tone  transmitted  field.  9y.  <  0  <  tr/2.  is  due  to  the  evanescent 
spectrum  of  the  isolated  antenna 

The  relationship  (41)  between  the  far-tunc  transmitted  field 
and  the  fat-rone  field  of  the  isolated  antenna  contains  the  trans¬ 
mission  coefficient  T,  in  the  factor  (■>, hl\)T;.  The  ratio  7,/y, 
accounts  for  the  change  in  the  density  of  tile  spectral  components 
on  refraction.  In  Fig.  6,  the  amplitudes  of  the  transmission  coef¬ 
ficients  T,.  r4  and  the  factors (7,/y,  )F,. <y,/7,  )T,  ate  shown  as 
functions  of  the  angle  of  transminon  A  for  the  case  k} ,  =  2. 
Note,  when  9  >  A,  the  transmission  coefficients  are  for  evanescent 
waves  incident  from  region  I.  The  exponential  factor  in  (41)  af¬ 
fects  only  the  ponton  of  the  transmitted  field  (0..  <  t  <  »/2)  that 
antes  from  the  evanescent  spectrum  of  the  isolated  antenna. 

The  shapes  of  the  far-aone  patterns  fot  the  transmitted  field, 
the  field  in  region  2  shown  in  Figs  4  and  $.  are  now  easily  ex¬ 
plained.  First,  consider  the  sequence  of  patterns  fot  the  hori- 
rontal  electric  dipoles,  the  patterns  on  the  left  of  tnese  figures. 
When  the  dipole  is  at  the  interface.  A/A,  =  0,  the  transmitted 
patterns  are  bread  with  significant  amplitude  for  6  less  than  and 
greate’  tnan  the  critical  angle  9C.  As  the  dipole  is  raised,  A/A,  in¬ 
creased.  the  amplitudes  3!  the  transmitted  patterns  at  angles  9  > 
9,  decrease,  becoming  negligible  when  A/A,  =0  31.  This  effect  is 
the  result  ol  the  exponential  factor  in  (41 1.  The  transmitted  field 
for  9  '  •  9{  is  due  10  the  evanescent  spectrum  of  the  isolated  an¬ 
tenna;  it  suffers  exponential  damping  which  incieascs  as  the  an¬ 
tenna  is  raised  above  the  interface.  Thus,  for  large  h/X,  (he  trans¬ 
mitted  pattern  is  significant  only  within  the  cone  of  angles  0  < 


8  <9,  For  ihe  two  cases  in  Figs.  4  and  $.  the  critical  angles  are 
*t  =  30 ---2)  and  A,  =  6 42"  fit,,  =*  94), 

The  sequence  cf  patterns  for  the  horironu.  magnetic  dipoles, 
the  patterns  on  the  right  of  Figs.  4  and  S.  show  less  variation  with 
the  height  of  the  dipole.  h/A, .  than  do  the  patterns  for  the  elec¬ 
tric  dipoles.  However,  the  patterns  for  the  component  of  the  field 
I  t\t  |  have  an  interesting  cusp  at  angles  neat  the  critical  angle  Ar; 
it  ts  most  pio.sounced  when  the  ratio  It, ,  it  large,  as  in  Fig.  5. 
The  cusp  is  due  to  the  factor  ir  the  expression  for  this 

component  of  the  field  (41).  it  it  clearly  shown  in  Ihe  graph  of 
(y,/Ti  »r,  in  Fig.  6.  The  physical  explanation  for  the  cusp  is  th.l 
on  refraction  the  spectral  components  of  the  isolated  antenna  are 
redistributed,  theit  density  being  increased  at  angles  near  the 
critical  angle.  The  cusp  Joes  not  occur  in  Ihe  patterns  for  the 
electric  ^dipole,  because  the  electric  field  of  the  isolated  eiectric 
dipole  E ^  (48a)  has  a  null  at  A'  »  »/2  (A  «  0t)  which  cancels  the 
cusp  in  (7,(7, )7y 

The  directivities  (49a)  and  (49b)  of  the  electric  and  magnetic 
horiror.nl  dipoles  for  lossless  media  are  shown  as  e  function  of 
the  height.  A/A , .  by  the  solid  lines  in  Fig.  8.  the  parameter  e-.,  n 
the  ratio  of  permittivities,  e,,  «  e,,/c, ,  The  directivity  tf  the 
electric  dipole  is  seen  to  be  maximum  when  the  dipole  is  close  10 
the  interface,  A/X,  a  0.1,  while  the  directivity  for  Ihe  magnetic 
dipole  is  maximum  when  A/X,  •  -  0.3$.  The  neak  directivities  for 
both  dipoles  increase  with  increasing  e,,.  and  for  e,,  ►  I  ihev 
aie  substantially  higher  than  Ihe  diteclivirie*  of  the  isolated  di¬ 
poles.  D,e  “  Dmt  r-  3/2. 


240 


tf.ff.  T*  A  NS  ACTIONS  ON  ANTENNAS  AND  NlOrAGATlON.  VOL.  AMJ.  NO.  ).  MARCH  |*M 


i* 


0  »  I 

r  tofGAiESi 


90  «0 

r  «KG*CE*> 


I it  I  n! 

loll,  ||  10 


0  JO  00  00  0  90  10  to 

«,  «  (DEGREES!  0,  »  lOtOMfSI 

ftf  4.  fUnc  «m  telVetion  coefficient*  at  a  function  of  the  an*Jt  of  incidence  #'  and  plane  »i«t  traninmwon  coefficient* 
as  a  function  of  the  antic  of  tranvrmuon#.  fo#  lotsle is  media  with  ft2|  •  2  0(e,2/r,j  •  4  0). 


EVANESCENCE! propagation 


UmiiyKj  I  7*m1 

-2  0  -10  0  J  A  .7  J  10 

_i - 1 - 1 _ i _ i _ i _ i _ i 

tji  i.o  oo*  vf  r  yt  f 


mf  t  vf  i,  * 


0  J  A  .7  J  1.0 

■  ’1*1 


EVANESCENCE  I  MOOAGATION 


ISOLATED  ANTENNA 


VF 


ANTENNA  OVEN 
HALF  SPACE 

7Er,. 

H  * 


Ft®.  7.  Relationship  between  pfopjplton  vector*  of  notated  antenna  and  of  antenna  over  half-space,  f«  kmleu  media  a 

*21  *2  0^,/#,,  -4.0>. 


7-24 


SMI?"  TRANSMISSION  INTO  MATERIAL  MALT  S? ACE 


241 


ELECIfttCOlPOLf  MAGNETIC  OWOLC 


Tif  8.  Directivities  (gauir  for  lorry  medu)  of  eteclnc  and  itugnenr  horuonul  dipoles  it  a  function  ot  thehetfht  hi  A,  above 
Ihc  interface  between  dielectric  media,  with  (he  ratio  of  permittivitiet  1)1  ■  lr]/i,|  at  a  parameter.  Solid  line:  lottleri  medu 
Pet  *  Pel  *  n-  darhed  line:  region  2  !ou>  Cel  *  0.  Cej  ■  10. 


The  dependences  of  ihe  direclivities  on  ilie  height  of  the  di-  finiiesimal  dipole  in  contact  with  a  dissipative  medium  must  have 
poles.  A/it, .  are  easily  explained  with  ihe  help  of  ihe  prior  dis-  infinite  power  input  lo  maintain  a  finue  field  ai  a  distance  |  lb| . 

cussion  of  the  field  patterns.  Consider  the  power  radiated  by  the  |I7| .  Since  .V  is  infinite,  the  gain  (311  is  zero 

dipoles  (or  their  field  patiernsi  lo  be  composed  of  three  compo¬ 
nents:  Ihc  power  in  the  main  beam  (the  pattern  in  region  2  within  V.  COMPARISON  WITH  EXPERIMENT 

the  cone  of  angles  0  <  8  <  8t),  the  power  in  the  sidelobes  f  Ihe 

pattern  in  region  2  at  angles  8r<8<  n/2  that  is  Ihe  result  of  the  A  limited  experimental  program  was  performed  to  confirm  the 
evanescent  spectrum  of  the  isolated  dipole),  and  the  power  in  the  theorelical  results  for  the  electric  and  magnetic  horizontal  dipole 

back  lobe  (the  pattern  ir.  region  I )  The  directivity  is  maximized  antennas  over  a  half-space.  The  experimental  apparatus  is  shown 

by  minimizing  the  power  in  the  side  and  back  lobes.  in  Fig.  9.  A  plastic  lank  containing  fiesh  water  has  a  vertical 

When  the  electric  dipole  isat  the  inteiface.  A/A,  *=  0.  the  power  metallic  image  plane  attached  at  one  side  Monopole  and  half¬ 
in  Ihe  sidelobes  is  large  and  the  directivity  is  low.  see  for  example  loop  test  antennas  are  mounted  on  Ihe  image  plane  and  are  fed 

h/X,  wOinFig.  5  The  power  m  the  sidelobes  decreases  as  the  dt-  from  behind  the  plane.  A  small  monopole  probe  protrudes 

pole  is  raised  above  the  interface  and  the  directivity  increases.  through  the  image  plane  and  is  free  to  .wove  through  300°  on  a 

The  power  in  the  back  lobe,  however,  increases  as  the  dipole  circle  of  radius  r  =  30  cm.  This  probe  is  used  to  measure  the  field 

is  raised  and  eventually  causes  the  directivity  to  decrease,  see  component  A'o(0>  in  the  air  and  in  Ihe  water.  A  second  probe. a 

for  example.  A/A,  =  0  35  in  Fig  S.  Thus,  these  two  competing  small  dipole,  is  mounted  on  a  moveable  arm  and  h  free  to  move 

effects,  the  decrease  in  the  power  in  the  sidelobes  and  Ihe  in-  through  90“  on  a  circle  of  radius^  «=  30  cm.  This  probe  is  used  to 

crease  in  power  in  the  back  lobe,  give  rise  to  the  peak  in  the  di-  measure  the  field  components  E,(A|  in  the  water.  At  the  mcas- 

rectiviiy  of  the  electric  dipole  for  A/A,  «  0.1.  The  increase  in  uremcnl  frequency  of  900  MHz  ana  loom  temperature,  the  elec- 

the  peak  directivity  with  increasing  <2 ,  is  Ihe  result  of  a  decrease  trrcal  properties  of  the  rresh  water  are  approximately  frrl  *  78  b. 

in  the  width  of  the  main  lobe  (A,  decreases  with  increasing  ej,!.  Pei  *  5  3  X  10' !  In  the  atr  0,z  =  flo?*  4.65,  and  in  Ihe  water 
Note  that  the  patterns  in  Figs.  4  and  5  ate  for  heights  that  roughly  flr?  *  50.6.  o2r  *  1 .34. 

correspond  to  the  maxima  and  minima  in  the  directivity  of  the  The  infinitesimal  horizontal  electric  dipole  was  approximated 
electric  dipole.  by  an  electrically  small  insulated  monopolc  antenna  with  height 

For  the  magnetic  dipole,  the  power  in  both  Ihe  sidelobes  and  i.  *  3  X  I0‘ 1  V  and  the  infinitesimal  horizontal  magnetic  dt 

•he  back  lobe  decrease  as  the  height  of  the  dipole  is  increased  M«  »»  approximated  by  an  electrically  small  insulated  half-loop 

from  AAt  *  0  The  back  lobe  is  minimum  when  A /A,  *  0.75.  «ntenna  with  radius  b  »9  X  10  '’  A*.  Note  that  the  plane  of  the 

*"d  the  maximum  directivity  occurs  close  to  this  point.  A /A,  w  loop  was  vertical  to  make  Ihe  axis  of  the  equivalent  magnetic  di- 

u.jy.  pole  parallel  lo  the  air-water  interface.  Measured  field  patterns  for 

The  gains  of  the  electric  and  magnetic  horizontal  dipoles  for  *«  *ntennas  arc  presented  in  Figs.  10  and  It  Results  are 

region  2  lossy.  prJ  =  I  0.  are  shown  by  the  dashed  lines  in  Fig.  ih',wn  f,,r  heights  of  the  antennas  above  the  air-water  tnter- 

8.  These  results  were  computed  (•;  substituting f45)  and  (35)  into  •*•’«.*/*•  =  OandO  I.  T be  prohes  used  to  measure  Ihe  field  were 

(31)  and  performing  the  imegratron  with  respect  top  numerically,  oncalibrated.  therefore,  the  normalization  of  the  patterns  is  ar- 

The  addition  of  loss  to  region  2  is  seen  not  to  affect  substantially  hilrary.  The  maxima  of  Ihe  patterns  m  regions  I  and  2  wc.e  set 

the  overall  shape  of  Ihe  directivity/gain  curves.  A  noted  difference  «9U»*  >°  <h«se  for  the  correspoi  ling  theoretical  resuhs  for  di- 
occurs  when  the  dipoles  are  on  the  interface  A/X|  =0;  all  of  the  poles  over  a  dielectric  with  e,,  *=  8.'.  Fig  5.  The  field  U#|(mcas- 
gain  curves  for  Ihe  lossy  media  are  aero;  while  the  directivity  tired  with  the  dipole  probe)  at  Jhe  f  rsl  measurement  point  6  *  4“ 

curves  for  the  lossless  media  have  nonzero  values.  The  aero  gain  »»*  *9ual  ,u  the  ft**d  in* I  (measured  with  the  monopole 

obtained  when  the  dipoles  are  in  contact  with  the  lossy  medium  probe)  at  the  angle  8  «  0  . 

is  a  consequence  of  a  phenomenon  described  by  Tai.  viz.,  an  in-  Th*  qualitative  agreement  between  the  theoretical  and  the 
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Fit  II. 


Measured  field  palleint  fur  mjfncnc  dipole  (electrically  small  loop)  in  III  at  two  heights  h/A^  above  interface  between 
air  (l)and  iresh  water  (2).  tgr  *  5  *S 


measured  patterns.  Fig.  5  and  Figs.  10  and  II.  is  good.  The  de¬ 
crease  in  the  side  lobes  of  ihe  transmitted  field  for  the  electric 
dipole  with  increasing  height.  hf\0 .  is  clearly  shown  in  Fig  10. 
A  noted  discrepancy  between  the  theoretical  and  the  measured 
patterns  is  the  absence  in  the  measurements  of  the  sharp  detail, 
such  as  the  cusps  at  angles  near  9C  in  the  theoretical  patterns  for 
the  magnetic  dipole.  This  is  attributed  to  the  measured  patterns 
not  being  far-aone  patterns,  the  radius  of  the  measurement  circle 
is  only  0.9  wavelengths  in  the  air  and  8  1  wavelengths  in  the 
water.  The  Finite  sire  of  the  field  probes  also  smooths  Ihe  meas¬ 
ured  patterns.  The  monopole  and  dipole  probes  subtend  angles 
on  the  measurement  circle  of  19°  and  3.8°,  respectively,  these 
angles  are  to  be  compared  to  the  critical  angle  9C  =  6.4°  which 
determines  the  width  of  the  main  beam  in  the  water. 

While  the  two  factors  described  above  are  thought  to  be  the 
major  causes  of  tit*  discrepancies  between  the  theoretical  and  the 
experimental  results,  there  are  other  factors  that  affect  the  com¬ 
parison.  The  model  antennas  are  only  approximations  to  the  in¬ 
finitesimal  dipoles.  This  difference  is  particularly  important  when 
the  antennas  are  near  the  interface;  for  example,  when  h/30  «  0. 
the  small  loop  it  half  in  the  air  and  half  in  the  water.  T  here  is  at¬ 
tenuation  in  Ihe  water  |exp  f-o,r)  w  0.76),  whereas  the  theo¬ 
retical  results.  Fig.  5.  are  for  lossless  media.  There  are  reflections 
fiom  the  walls  of  the  tank  and  edges  of  the  image  plane,  although 
these  are  reduced  by  the  attenuation  under  the  water.  The  re¬ 


sponse  of  the  dipole  probe  on  the  moveable  arm.  see  Fig  9,  is  af¬ 
fected  by  its  image  in  the  metal  plane  at  small  angles  9.  this  pro¬ 
duces  the  ripple  in  Ihe  measured  patterns  I  /?,  I  for  angles  near 
9  "  0°.  A  similar  effect  occurs  when  the  monopole  or  dipole  probe 
is  near  the  air-water  interface. 

The  relative  gain  (31)  was  measured  for  a  quarter-wave  mono¬ 
pole  antenna  (i/X«  w  0.15)  in  air  above  the  surface  of  the  water, 
and  the  results  are  plotted  as  a  function  of  the  distance  above  the 
interface,  hfa,  in  Fig.  12.  The  general  behavior  of  the  measured 
gain  for  the  quarter-wave  monopole  is  in  good  agreement  with  the 
theoretical  directivity  (gain)  for  the  infinitesimal  electric  dipole. 
Fig.  8.  The  gains  of  electrically  small  monopole  and  half-loop  an¬ 
tennas  were  not  measured,  because  the  losses  in  the  transmission 
line  connections  to  the  small  antennas  were  significant  and  could 
not  be  determined  accurately  enough  for  subtraction  from  the 
measurements. 

VI  summary  and  conclusion 

A  procedure  was  developed  to  analyte  antennas  for  directive 
transmission  into  a  material  half-space.  This  is  based  on  the  plane 
wave  spectra  of  the  fields  for  the  isolated  antent.a  and  for  Ihe  an¬ 
tenna  over  the  half-space.  The  "geometrical  optics"  field  was 
used  to  define  pattern  functions,  a  gain  and  a  directivity  that  de¬ 
scribe  the  directive  properties  of  the  antenna  over  the  half-space. 

The  directive  properties  of  infinitesimal  electric  and  magnetic 
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hr  12  Mcjuired  relative  Cain  fill  *  0)  of  quarter-wave  monopole 
f/./Aq  *  0  251  asa  luncuam  of  height  h/x„  above  the  interlace  between 
air  am)  frccfi  main. 


horizontal  dipole  antennas  mere  examined  for  the  case  of  lossless 
media  The  results  for  flic  dipoles  are  indicative  of  those  for  other 
antennas  that  can  be  modeled  by  electric  and  magnetic  horizontal 
current  sheets.  The  theoretical  predictions  for  the  elementary  di¬ 
poles  aie  in  good  agreement  with  the  experimental  results  for  an¬ 
tennas  near  an  air-mater  interface,  even  though  there  are  many 
differences  between  the  theoretical  and  experimental  models. 

An  interesting  outcome  of  the  investigations  is  that  the  hori¬ 
zontal  electric  dipole  can  produce  a  beam  into  the  half-space 
below  the  antenna  when  the  ratio  of  wavenumbers  I  * 2 ,  I  is  greater 
than  one.  such  as  when  the  dipole  is  in  the  air  above  the  earth. 
The  maximum  directivity  occurs  when  the  dipole  is  close  to  the 
interface,  h /X,  *  0.1,  and  for  lossless  media  the  half-width  of  the 
beam  is  approximately  the  critical  angle  fl,  =  sin"1  (<;,  j). 

The  previously  mentioned  directive  properties  of  the  resonant 
circular-loop  antenna  over  a  hall-space.  Fig.  2.  arc  now  easily  ex¬ 
plained  Recall  that  the  tcvmjni  loop  antenna  (fi0h  ~  1 .0)  has  a 
current  disl’ibulion  proportional  to  cos P,  and  that  tills  is  approx¬ 
imately  equivalent  to  the  current  in  a  pair  of  parallel  electric  di¬ 
poles.  The  equivalence  for  the  currents  is  shown  schematically  in 
Figure  IJ(a).  In  Fig.  13(b).  the  fat-zone  field  pattern  is  plotted 
for  the  pair  of  infinitesimal  electric  dipoles  that  are  equivalent  to 
the  loop  antenna  described  in  Fig.  2  (loop  in  air  above  wafer, 
2*  *  Xj/v,  .i/Xo  »  0075).  Comparisons  of  the  field  patterns  in 
Figs.  2(a)  and  13(b)  and  the  gams  m  Figs.  2(b)  and  I)  show  that 
the  directive  properties  of  the  resonant  loop  ate  correctly  de¬ 
scribed  by  the  theory  for  the  electric  dipoles.5 

The  theoretical  and  measured  gams  (directivities)  for  the  in¬ 
finitesimal  dipole,  the  quarter-wave  monopole  and  the  resonant 
loop.  Figs.  8.  12.  and  2(h).  all  have  peaks  (or  h/X,  in  the  range 
0  07  <  h/X,  %  0  12.  It  is  interesting  that  the  input  resistances  of 
quarter  wave  dipole  and  resonant  circular-loop  antennas  over  the 
earth  obtain  reasonable  values  over  the  same  range  of  h/X,  This 
h  illustrated  in  Fig  14  where  the  input  resistance  is  pinned  as  a 
function  of  hfa  for  a  resonant  dipo'c  over  wet  ground  (experi¬ 
mental)  1 1 K) .  fur  a  dipole  of  half-length  fol.  =  1.51  over  earth 

*  Note  that  the  pattern  m  Fig  1 3(h)  is  for  the  fair  .me  and  totsksi 
media,  while  the  pattern  in  fig.  2ia)  it  fur  a  fmiie  radnit  and  low-loo 
media . 


Fig.  t }.  (a)  Schematic  diagram  of  current  m  resonant  cwcwlat  loop  tdfch  • 
1 .0)  and  in  appioxnnaleft  equivalent  paw  of  electric  dipoles  (h)  f  wM 
pattern  for  pair  of  infinitesimal  rkcuic  dipoles  m  aw  above  warn.  2*  • 
Aq/w.h/Xq  ■  0.07$,  prg  •  0. 
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1  if.  14.  input  rcwtunce  of  muni*  NttiAii  4%  I  function  of  the  ht*ni 
A/A0  above  the  interface  between  im  (II  end  eirfh  <21  A  fipera- 
mcnul  re  to  runt  dipole  over  wet  pound  |lt|  B  TNoretKtl  dpolt. 
#0 1  *  ■  U.Prl  •  0  20  1 1 9 1 .  C:  Theoretic*! -circular  loop, 

lift  •  1 .0.  •  I0.|»rj«0  0|  |I0| 
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with  *„i  -  |5.p,j  -  0.20  (theoretical)  (I9|.and  for  ■  circular 
loop  of  radius  Sfi  -  1 .0  over  tsnh  wiih  -  10.  p,j  »  0.01 
(theoretical)  (10) .  The  input  reaiauncea  Sot  the  dipoles  and  loop 
art  approximately  50  Tl  and  100  tl,  teapcctively.  whtn  A/X*  it 
ncai  0.1.  These  icaulU  suggest  tint  single  resonant  dipole  and 
loop  antennas  or  simple  array*  ol'  these  element*  may  he  useful 
for  directive  transmission  into  (he  earth. 

The  analysis  presented  in  this  paper  applies  to  both  the  case 
|  A, ,  |  >  I  and  the  case  I  *j  1 1  <  1 ;  however,  due  to  limitations  on 
the  length  of  the  manuscript,  numerical  and  experimental  result* 
were  only  presented  for  the  case  |A2|  I  >  I.  Results  for  the  case 
I  A)  1 1  <  I  were  also  obtained,  and  these  will  be  presented  in  a 
later  publication. 

During  the  preparation  of  this  manuscript,  a  paper  appeared 
by  Enghcta  el  al.  [20|  that  determines  the  radiation  patterns  for 
infinitesimal  horizontal  and  vertical  electric  dipoles  on  a  planar 
interface  between  two  dielectric  regions  (only  the  case  A/A,  =  0). 
Their  patterns  for  the  horizontal  dipole  agree  with  the  patterns 
for  the  horizontal  electric  dipole  presented  here.  A/X,  =  0  in  Fig. 
4.  Their  patterns  for  the  vertical  electric  dipole  have  cusps  at  an¬ 
gles  near  9  »  9,  as  do  the  patterns  presented  here  for  the  hori¬ 
zontal  magnetic  dipole.  A /A,  —  0  in  Fig.  4.  This  is  not  surprising, 
since  the  horizontal  magnetic  dipole  can  be  modeled  by  a  small 
vertical  square  loop.  The  loop  is  equivalent  to  a  pair  of  couplets, 
one  vertkcl  and  one  horizontal,  each  formed  from  oppositely 
directed  electric  dipoles.  The  vertical  electric  dipoles  can  be  con¬ 
sidered  the  source  of  the  cusps  in  the  pattern  for  the  horizontal 
magnetic  dipole. 

APPENDIX 
DIPOLF.  ANTENNAS 

Consider  the  infinitesimal  electric  and  magnetic  horizontal  di¬ 
poles  p  ■  py  and  m  «  my  aligned  with  the  y  axis.  For  an  elec¬ 
trically  short  linear  element  of  current  /  and  len-'h  SI  the  elec¬ 
tric  moment  it  approximately  p  «  -/7A//u.andi«;an  electrically 
small  loop  of  current  I  and  area  SA  the  magnetic  moment  is 
approximately  m  «  ISA  The  equivalent  surface  current  densities 
for  these  elementary  sources  are 

K,  « l&ll(r\i OlP  =/cjpi(x)6(»r  (44a) 

and 

Km  -  /cju0/A4&(x)«0'ly  "  /cJUomS(x)*0')>  ■  (44b) 


where  4(x)  it  the  Dirac  delta  function.  After  inserting  (44a) 
into  (37)  and  (44b)  into  (38).  the  spectral  density  functions  be¬ 
come:  for  the  electric  dipole 


and  for  the  magnetic  dipole 


Mi*t*.P 
2Ky,((C)  ' 


(45a) 


*}(*)» 


AuUqA, Ayffl 

2A'Ti(A) 


•  *!<*)« 


/cuppAym 

2A- 


(45b) 


The  far-zone  electric  fields  of  the  dipoles  are  obtained  by  suu- 
atituting  (45)  into  (70)  and  (21)  and  evaluating  the  resulting  in¬ 
tegrals.  For  the  electric  dipole 


*  ll+/f1(A.„)e-/,‘«*u“*,le-l*i'/4irr  (46b) 

£j*  -  u>Wj  |P  tin  PI  coed  |  exp  (-jA.AV’l  -*j|j  ainJ8) 

*  r,(K,t)e-i'yn„  (46c) 

“  cu’uoAs iP  toad! co*8|exp(-/A,A>/l-A5|  un2  0) 

*  ri(A. i )T ‘ M  2 ' /4 erVl  —Aj,  sm:  6  .  (46d) 


and  for  the  magnetic  dipole 

E\t « uiuomA,  cotde,*,*'c,>,*, 

•)l  +£,(A;„)e-2,*i‘“«»*|«-/*»'/4»r  (47a) 

£(o  =  aindlcosd|eJ*rkUo‘*1 

•  [»  -*J(K„)e-,,***lt"*'|r~/*,'/4»r  (47b) 

Eii  =  uipQmk  i  cos  p  I  cos  9 1  exp  (-/A ,  Wl— A2|  sir1  0 ) 

•  7-,(AIj)«'/*,'/4*r>/i  -*n  sin1  0  (47c) 

Ell  =  -«Jti0mA}  tin d  | cos 9 1 expf-rXjAv/l -*ji  sin1  9) 

•  n(A,j)e-<*1'/4»z.  (47d) 

The  far-zone  fields  of  the  Isolated  electric  and  magnetic  di¬ 
poles  are  also  needed  for  later  comparisons:  for  the  electric  di¬ 
pole 

E',{e,  0‘.  d’)  “  uW  [cos  0'  sin  fb'  +  cos  d'd‘J«''*  *'  /4*r', 

(48a) 

and  for  the  magnetic  dipole 


E!('.0',0‘')’kup o»>*i  [cosd'*‘-sind'cos9'd')e*,*', /4xr\ 

(48b) 


The  directivities  for  the  electric  dipole  and  the  magnetic  dipole, 
0,(8  =  0)  and  Om{0  =  0).  respectively,  (regions  I  and  2  lossless, 
A,  and  Aj  teal)  are  obtained  from  (36)  with  (35)  and  (4S).  After 
considerable  reduction,  the  directmties  can  be  cast  in  the  foliow- 
ing  simple  forms,  where  the  cases  A,  >A,  and  Aj  <A,  are  treated 
sepa,.u.ly.  For  A2  >  A, 


i  f'it 

•co*(27,A)p<#p--  /  ly,l<-,,T''* 

*1  M 


•  £lm(R,)  +  |^j  lm(/?j)Jpdp  |  (49a) 

‘totG-tth^dp--  f  ^ 

*  I  ^1 

•  j^|  )ro(/f,)jp</pj  ,  (49b) 


£'n  m  -Tj,PoP*fndlcos9|e<*|b|,0,*l 

•l«-*l(A.i)4-,,*'*'“',*')«  /*i'/4xr 


(46a)  andforAj<A| 
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In  drew  expressions  Ti .  R,  and  Ri  are  funclions  of  p.  Ihey  are  2 1^225*  mT'i w> ,W *  •  "H  Ar  "• w 

given  by  (6)  and  (11)  with  K/k,  =  p.  Note.  *  hen  the  electrical  c'V  tV.^'r.I^.L  <*  .  Henr.w  d.pol,  ««me«d  ..  .  <t,«,p.i„« 

properties  of  the  tun  media  are  the  ume(*21  »  |).  all  of  the  di-  medium. "  Cmh  Ljb.  Ter*  Rep  21.  H.rv*rd  Umv  .Cwnbndpc.  MA. 
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.  ...  „  ,  .  |17|  J  R  Wan.  Electromagnetic  fields  of  sources  in  kns>  media,  m 

uvrty  of  an  electric  or  a  magnetic  dipole  in  a  homogeneous  A„„„  Thtoey  „  E  CoIllB  tnd  f  ,  w> 
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Limitations  on  the  Size  of  Miniature 
Electric-Field  Probes 

GLENN  S.  SMITH,  senior  member,  ieee 


Btatalw*  dtpoh  pah*  k  •  oriid  tool  lor  ■ronriag  *a 
tfectrir  Arid  at  ll(hnBtaai  Buiaw  Imwada  A  mail  Mp 
for  ifec  yrob*  comma*  of  as  HrofertwBy  But  aaoa— 0  «Mi  a  Boft  ton 
in  tmainob;  a  trrisri-*.  pnRal »»«  tnromtaloa  Uaa  nvmmfei  At 
detected  Ugxri  from  Ifee  Agfa  lo  Ifec  teoniforiyt  Iwtroro caaaUoo  Soal 
dipoles  tee  OrurcMc  became  Ibejr  peoride  hick  apodal  rtaoluUoo  ol  Ifea 
field,  tal  became  (her  permit  a  fewpiency-iodepe adra  mpoaw  at  hiffeee 
■licxowire  (roqucnctes.  Rectal  efforts  bare  produced  probea  widi  dipole 
baR  leoftfea  *  Ira  dua  one  aiifliiweter.  With  the  advances  ocewiaf  In 
aaiciueiccnorirca  and  Ifeto-fibo  Mefewohgy.  Ifee  cooalmcdon  of  earn  writer 
probea  otajr  be  poanble. 

to  dm  paper,  tbe  Nanieriam  bopoaed  on  Ifee  xemMrify  of  dm  probe  bp  a 
reduction  in  ha  physical  rile  arc  determined.  A  aaodtl  that  comaina  none 
aowcca  for  the  diode  and  Ifee  reaiaiire  tramauvuoa  bae  h  mod  lo  obtain 
the  aipnal-io-ooiae  ratio  lor  the  probe,  and  dda  b  examined  aa  a  liovtion  of 
the  paraaetera  tfeal  describe  the  dipole,  diode,  reatadsc  transmission  Kao, 
and  amplifier.  When  the  physical  dimensions  of  Ifee  probe  are  reduced  bp 
the  scale  factor  *,(»,<  I),  the  rigori-to-notec  ratio  b  found  la  decrease  by 
approximately  tbe  factor  A/,  and  ifee  minimum  delectable  incident  electric 
field  for  a  fixed  sifnal-to-noise  ratio  b  found  to  increase  bp  approximately 
the  factor  k,~\  A  numerical  estimate  b  nude  lor  Ifec  acasidetty  of 
minianu*  probes  with  dipole  hall  lengths  in  the  range  IB  pm  <  fe  A  I  cm. 

I.  Introduction 

A  DIPOLE  ANTENNA  that  is  electrically  and  physi¬ 
cally  small  is  a  useful  probe  for  measuring  electric 
Helds  of  unknown  strength.  The  current  interest  in  the 
biological  applications  and  the  possible  health  hazards  of 
nonionizing  electromagnetic  radiation  has  led  to  the  devel¬ 
opment  of  miniature  dipole  probes  for  use  in  monitoring 
Helds  both  in  free  space  and  in  material  media.  The 
physical  size  of  the  miniature  field  probe  has  been  continu¬ 
ously  reduced.  Operational  probes  with  dipole  half  lengths 
A  less  than  0.8  mm  have  been  developed  by  the  U.S. 
Bureau  of  Radiological  Health  and  by  its  contractors 
(1 J-13J.  and  experimental  probes  with  h  as  small  as  0.3  mm 
have  been  produced  at  the  University  of  Virginia.  Char¬ 
lottesville  (4|.  With  the  advances  occuring  in  microelectron¬ 
ics  and  thin-film  technology,  the  construction  of  even 
smaller  probes  may  be  possible.  The  subject  of  this  paper  is 
the  limitations  imposed  on  the  response  of  these  probes  by 
a  decrease  in  their  physical  size. 

A  schematic  drawing  of  a  typical  dipole  receiving  probe 
is  shown  in  Fig.  1.  The  operat .  u  of  this  probe  is  fairly 
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simple.  For  an  amplitude-modulated  incident  Held,  the 
dipole  produces  an  amplitude-modulated  oscillating  volt¬ 
age  across  the  diode  at  its  terminals.  When  the  diode  is 
operating  in  its  square-law  region,  a  current  proportional 
to  the  square  of  the  modulating  signal  is  also  developed  at 
the  diode.  For  example,  a  continuous-wave  field  produces 
a  direct  current  at  the  diode.  This  current  is  passed  through 
the  low-pass  filter  formed  by  the  lossy  transmission  line  to 
the  monitoring  amplifier.  Thus,  a  signal  proportional  to  the 
square  of  the  amplitude  modulation  on  the  incident  field  is 
measured.  The  high-resistance  per-unit-length  of  the  lossy 
transmission  line  reduces  the  signal  received  directly  by  the 
line  and  transmitted  to  the  diode;  it  also  reduces  the 
scattering  of  the  incident  field  by  the  ir.-nsmission  line.  A 
transmission  line  formed  from  two  different  sections  is 
shown  in  Fig.  1;  the  section  nearest  the  dipole,  tine  I.  has 
(be  highest  resistance  per  unit  length. 

Fig.  2  shows  a  typical  construction  for  the  miniature 
e'ectric-field  probe.  'Pie  conductors  of  the  resistive  trans¬ 
mission  lines  and  the  discrete  resistors  are  formed  by 
depositing  thin  metrlhc  films  on  a  dielectric  substrate.  The 
diode  is  usually  an  unbiased  Schottky  barrier  diode  of 
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Fig.  i.  tqrnnkml  dtcuit  tot  probe 

beam-lead  construction  with  the  kadi  forming  all  or  part 
of  the  dipoU  antenna. 

II.  Dftecttd  Sional 

In  the  model  for  the  miniature  probe  ihown  in  Fig.  1. 
the  dincie  and  the  conductors  of  the  two  transmission  lines 
arc  formed  from  flat  strips  with  the  widths  wA,  wu,  and 
wL2,  respectively.  The  Keif  length  of  the  dipole  is  A,  and 
the  lengths  and  spacings  of  the  transmission  line  conduc¬ 
tors  are  rt,  r,  and  h,,  b2.  The  resistivities  and  thicknesses 
of  the  thin  films  forming  the  conductors  of  the  transmis¬ 
sion  lines  are  adjusted  to  produce  the  resistances  per  unit 
length  rj  and  rj.'  The  capacitances  per  unit  length  of  the 
lines  are  c,  and  c3. 

The  response  of  the  miniature  probe  is  easily  determined 
from  the  equivalent  circuit  shown  in  Fig.  3.  Detailed 
discussions  of  this  circuit  are  given  in  (S,  ch.  3]  and  in  (6); 
the  analysis  of  the  circuit  will  only  be  summarized  here. 

In  the  high-frequency  portion  of  the  equivalent  circuit, 
the  dipole  antenna  is  represented  by  its  Tbdvenin  equiv¬ 
alent.  The  open-rreuit  voltage  at  the  terminals  of  the 
clectrically-sbon  receiving  dipole  is  approximately  pro¬ 
portional  to  the  component  of  the  incident  electric  field 
parallel  to  the  axis  of  the  dipole  (z  axis) 

0) 

and  the  input  impedance  of  the  clectrically-short  dipole  is 
approximately  capacitive1 

ZA--j/aCA  (2a) 

where 

CA-^„h/[]o(4h/wA)-\].  (2b) 

The  effective  relative  permittivity  *„  is  included  in  (2b)  to 
account  for  the  dipole  being  on  a  dielectric  substrate. 

The  high-frequency  circuit  for  the  diode  is  the  junction 
impedance  Rf  in  parallel  with  Cr  in  series  with  the  resistor 
R,.  The  resistor  R,  is  small  (typically  R,  -  5  to  25  0),  and 
will  be  omitted  in  the  following  analysis  where  it  is  as¬ 
sumed  that  Rj  »  R,  and  uC,R,  « 1. 

The  complex  wave  number  for  the  highly-resistive  trans¬ 
mission  line,  line  1,  is  approximately  ku  ■  /ur,'c,  (I  -  j). 
The  parameters,  viz,  rj,  et,  and  tt,  of  this  line  are  chosen 
so  that  the  transfer  function  r  for  a  wave  propagating  over 

'TV  thkkntuo  of  the  thia  fitmi  ere  utuaed  to  be  imsB  compered 
with  the  ituo  depths  in  the  ftaitivt  matenelj  el  the  tnqueaoa  of 
lateral.  to  thet  the  mutaaoa  per  unit  length  r‘  are  ipproeimelrty 
frequency  independent. 

'Here  the  thin  ttrip  of  width  w4  it  euwmrd  to  be  approximately 
aqut-elent  to  t  circular  conductor  of  radiut  «  —  w4/4. 


the  line  is  smell  at  the  frequence  of  intercat,  Le. 

r-k'/*“*‘l«I.  (3a) 

The  input  impedance  to  bansmission  line  1  is  then  ap¬ 
proximately  us  characteristic  impedance  Zrt  (7] 

Z«,  -  ♦  JK rt  -  ^r/«f.  (l  -  j).  (3b) 

This  impedance  in  series  with  the  resistance  2  appears 
across  the  diode.  The  discrete  resistors  K,  are  included  to 
keep  the  transmission  line  from  presenting  a  low  imped¬ 
ance  across  the  diode  at  high  frequencies. 

In  the  low-frequency  portion  of  the  equivalent  circuit, 
the  diode  is  modeled  by  the  voltage  source  VA  in  series  with 
the  video  resistance  R, 

v*-  y>r  (<») 

where  y#  is  the  voltage  sensitivity  and  R  is  the  time 
average  of  the  high-frequency  power  absorbed  by  the  junc¬ 
tion  resistance  Rt  of  the  diode.  Note  that  y0  is  the  voltage 
sensitivity  of  the  diode  junction;  it  is  not  to  be  confused 
with  the  voltage  sensitivity  after  compensation  for  the 
effects  of  junction  capacitance,  load  resistance,  and  reflec¬ 
tion  loss  (8).  The  latter  is  sometimes  reported  in  manufac¬ 
turers’  specifications.  The  current  sensitivity  is 

A  “To/*,-  (4b) 

For  an  ideal  diode  at  a  temperature  of  290  K,  fa  ■  20 
A/W. 

The  low-frequency  model  for  the  two  resistive  transmis¬ 
sion  lines  in  scries  is  the  “Pi"  equivalent  network  with  the 
elements 

Rt~2('fa  +  4*i)  (5») 


^  MjI.  .  F,'s,(r,S|  +  e,*,) 

This  network  is  obtained  by  combining  two  “Pi"  networks, 
one  representing  each  of  the  transmission  lines,  and  drop-, 
ping  terms  of  order  wr,'r,rjjj(c,r,  +  c2t2)/(rjit  +  rjs2). 
When  a  single  resistive  transmission  line  is  used,  the  ele¬ 
ments  in  the  network  are 

RL-2r't,  Cu-Cm-cs/2.  (6) 

Note  that  the  “Pi”  network  is  a  low-pass  filter.  When  it  is 
driven  by  an  ideal  voltage  source  and  terminated  in  an 
open  circuit,  the  3-dB  cutoff  frequency  is 

uL  “  2t/i  “  (HgCgj) 

-  +2ci'i)+riVi*jrl  (7) 

The  low-frequency  amplifier  is  assumed  to  be  an  ideal 
noiseless  amplifier  with  gain  G,  a  "  bnckwall"  passband  of 
bandwidth  &u  -  2*&/,  end  an  infinite  input  impedance.1 

'TV  eapadianw  si  IV  iaput  to  IV  amplifier  esa  V  ioduded  ia  tV 
anaiytu  by  amply  *ddn|  it  to  (be  capacitor  CLt 
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The  assumption  of  an  ideal  amplifier  amplifies  the  analysis 
and  permits  a  discussion  of  the  probe's  characteristics,  such 
as  its  signal-to-noise  ratio,  independent  of  the  particular 
amplifier  used. 

For  a  continuous-wave  incident  field  (unmodulated  sig¬ 
nal).  the  detected  signal  |KJ  at  the  terminals  of  the  ampli¬ 
fier.  as  determined  from  the  equivalent  circuit  in  Fig.  3,  is 


a  on  mcsowavi  doom  smd  tsomoum,  vot  wtt-K.  ho.  4,  kins  ism 

where  the  fact  that  the  spectra  of  the  three  noise  sources 
(10),  (11),  and  (13)  are  approximately  frequency  indepen¬ 
dent  has  been  used.  The  squares  of  the  magnitudes  of  the 
voltage  transfer  functions  Ha(u)  and  u)  are 

[ ( «V<*twa  )* + ( )  /  + 1] " '  (13) 

l#a(*»)l1  “  ( 1  +  ( u/u* )’]  /( ( )J 

+  («V«i“a)^  +  lJ  (16) 


u,-((A.-s2A,)Curl 


2/*y{(i  +  Ay(2R,  +  *,i)/P*i  +  2„i,j,+  [u/U(-H;Jf,l/|21ll  +  Z,,!1!2} 


««“l^(C,a+C,)j'1.  (9) 

III.  Noise  Analysis 

The  noise  votuge  at  the  terminals  of  the  amplifier  is 
obtained  from  the  noise  equivalent  circuit  shown  in  Fig.  4. 
Each  of  the  noise-voltage  sources  c„(r)  in  the  circuit  is 
associated  with  a  time-average,  one-sided,  (voluge)  power- 
density  spectrum  /*„(/).  The  noise  power-density  spectra 
for  the  two  thermal -noise  sources,  the  resistances  2  A,  and 
rtt.are 

'.«,(/) -«n2*,)  (10) 

r.*df)-*kTRL  (11) 

where  Boltzmann's  constant  k  •OSxlO"**  J/K.  and  T 
is  the  temperature  in  degrees  Kelvin  (9],  (10). 

The  noise  power-density  spectrum  for  the  diode  is  ap¬ 
proximately  [U]-(13| 

r.oif) -4WK.UI  +  /v//v)  (12) 

where  r„  is  the  “white  noise  temperature  ratio."  fy  the 
video  frequency,  and  the  term  /*//*  accounts  for  the  "1// 
noise"  or  “flicker  noise"  of  the  diode.  The  diode  in  the 
miniature  probe  is  essentially  unbiased.  No  external  bias  is 
applied  to  the  diode,  and  the  self  bias  is  very  small  due  to 
the  large  series  resistance  in  the  low-frequency  circuit  For 
an  unbiased  diode,  r.  « 1  and  /*  *  0;  thcr.  fore,  the  noise 
power-d.-nsity  spectrum  of  the  diode  is  approximately 

Pmo(f)m  kkTR,.  (13) 

The  mean-squared,  noise  voltage  at  the  oulput  of  the 
amplifier  is 

-  Cl[(  #■.»  +  df 

+  *.iuJPK(/)l1<f/]  (14) 


*  -  K/«a)(l  +  Cu/Cu)*+  w./uj,  +  2 CLJ/CU. 


The  integrals  in  (14)  can  be  evaluated  in  closed  form 
[14];  after  considerable  reduction,  one  obtains  the  root- 
mean-square  (RMS)  noise  voltage  at  the  output  of  the 
amplifier 


^tan‘'(A«/7«i“*/) 


■{2R,+  R, 

♦*t[  l-(«t/«a)/l) 

-  l2B~'(bu/JuLvtf) 

•{2R,  +  R,  +  Rt(l  («t/u,)/)}j] 


(19) 

X-Vxl-  4' 

(20a) 

/-(X-X)/2 

(20b) 

,-(/  +  4)/2 

(20c) 

and  it  is  assumed  that  X1  >  0 
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Urge  compared  with  the  diode  impedance,  Le,  pA,  +  Zel | 
>■  A,.  This  prevent*  the  impedance  from  “loading  down” 
the  diode,  in  addition,  the  junction  romance  and  the 
video  resistance  of  the  diode  are  taken  to  be  approximately 
equal,  R,  »  R,.  The  voltage  sensitivity  of  the  diode  ys  can 
be  expressed  in  terms  of  the  current  sensitivity  (4b).  y0  - 
fi0R,mfi0RJ.  With  these  assumptions,  the  detected  signal 
(8)  becomes 


IV.  Signal-to-Noise  Ratio 

With  the  expressions  for  the  detected  signal  (8)  and  the 
RMS  noise  voltage  (19)  available,  the  signal-to-noise  (S/N ) 
at  the  output  of  the  amplifier  is  determined 


If  u1 »  ul  the  response  of  the  probe  is  independent  of 
frequency,  and  (22)  *s  simplified 


S/N-  (21) 

This  expression  is  easily  inverted  to  find  the  minimum 
incident  electric  Held  |£,'|  that  can  be  detected  for  a  given 
signal-to-noise  ratio. 

To  test  the  theory,  a  numerical  calculation  was  made  for 
a  miniature  probe  for  which  experimental  data  are  avail¬ 
able.  The  probe  considered  waj  developed  by  the  U.S. 
Bureau  of  Radiological  Health  and  the  Narda  Microwave 
Corporation  (BRH  Model  10.  Narda  Model  25256):  it  has 
a  dipole  half  length  A  -  0.75  mm:  approximate  values  for 
the  other  parameters  that  describe  the  probe  are  listed  in 
Table  1. 

In  measurements  made  by  the  Bureau  of  Radiological 
Health,  a  signal-to-noise  ratio  of  10  for  a  1-Hz  detector 
bandwidth  (A/  - 1  Hz)  was  obtained  with  this  probe  in  an 
incident  plane-wave  field  at  the  frequency  / "  2.45  GHz 
and  at  a  power  density  of  0.05  mW/cra2  (peak  incident 
elcctiic  field  |£,'|  - 19.4  V/m)  |3).  A  similar  probe  (Narda 
Model  2608)  was  tested  at  the  University  of  Ottawa;  their 
measurements  at  the  frequency  /-1.0  GHz  show  that  a 
peak  incident  electric  field  |£,‘|  “  7.7  V/m  is  required  for 
the  same  signal-to-noise  ratio  and  detector  bandwidth  (15). 
A  theoretical  calculation  made  by  using  (8),  (19),  and  (21) 


1^1 -Cft 


Vie;!’ 
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The  total  resistance  of  the  transmission  lines  R  L  -  2(  r,'*, 
+  rjj2)  is  chosen  to  be  much  greater  than  the  resistance 
2A,  +  At,  i.e.,  Rl  »  2A,  +  A,,  making  wt/w,«l  for 
Qt  and  CLi  of  comparable  value:  With  this  assumption, 
the  noise  voltage  (19)  becomes 

<u2>1/J-Gy/-^tan',(Aw/“z) .  (24) 

Simplified  approximate  expressions  for  the  signal-to- 
noise  ratio  and  the  minimum-detectable  incident  electric 
field  for  a  fixed  S/S  result  from  the  use  of  (23)  and  (24)  in 
(21) 

,B) 


|£;|*  l2(S/JV)^^tan-(A«/«i)/A,A*j 


indicate*  that  an  incident  electric  field  | £,’(  -  2.5  V/m  can 
be  detected  under  the  same  conditions.  The  agreement 
between  the  measured  and  the  calculated  incident  fields 
(they  differ  by  factors  of  7.8  and  3.1)  is  surprisingly  good, 
considering  that  the  parameters  for  the  diode  are  only 
typical  values  for  the  type  of  diode  used  and  that  no 
account  was  taken  of  the  noise  in  the  amplifiers. 

V.  Sensitivity  Versus  Probe  Size 

One  objective  of  this  study  is  to  determine  the  signal-to- 
noise  ratio  and  the  minimum  incident  electric  field  |£,'| 
that  can  be  detected  for  a  given  signal-to-noise  ratio  as  the 
physical  size  of  the  probe  is  decreased.  The  expressions  (8) 
and  (19)  for  the  detected  signal  and  the  noise  voltage  are 
loo  complex  in  their  present  form  to  extract  any  general 
dependence  of  the  sensitivity  on  the  parameters  that  de¬ 
scribe  the  probe.  The  complexity  of  these  expressions, 
however,  can  be  greatly  reduced  by  making  a  few  simple 
assumptions. 

The  impedance  2  R,  +  that  shunts  the  diode  in  the 
high-frequency  equivalent  circuit.  Fig.  3,  is  chosen  to  be 


(1  +  C^/Cj.  (26) 

Note  that  the  inequalities  used  in  obtaining  (22)  and  (24). 
pA,  +  Z„| »  A,  and  Rt  »  2A,  +  A„  can  be  satisfied  by 
choosing  a  diode  with  a  suitably  low  junction  or  video 
resistance,  since  2  A,  is  of  the  order  of  Rf  or  A,.  These 
inequalities,  however,  are  not  the  only  conditions  that  must 
be  considered  when  choosing  Rr  The  junction  resistance 
also  enters  the  expression  for  the  frequency  *>,  (9)  which  is 
the  lower  bound  for  the  frequency-independent  response  of 
the  probe.  A  discussion  of  this  phenomenon  is  in  |6|. 

It  is  interesting  to  examine  the  expression  for  the  noise 
voltage  (24)  for  two  limiting  cases,  i)  the  bandwidth  of  the 
amplifier  equal  to  the  3-dB  cutoff  frequency  of  the  trans¬ 
mission  lines.  Au/dj  "1,  and  ii)  the  bandwidth  of  the 
amplifier  much  less  than  the  3-dB  cutoff  frequency  of  the 
transmission  lines,  bu/ul  cl.  In  the  first  case.  (24)  be¬ 
comes 


(27) 


m 
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When  the  lews  lance  of  (he  first  transmission  lire  is  much 
peater  then  that  of  the  secood  (r{si/r{/t1  I),  the 

capacitance  Ci2  (5c)  end  the  noise  voltage  (27)  we  nearly 
ir  dependent  of  the  resistance  of  the  transmission  lines  RL. 
This  is  the  result  of  the  noise  power-density  spectrum  P,tl 
of  the  transmission  lines  being  proportional  to  RL  and  the 
bandwidth  of  the  amplifier  Aw  Doing  proportional  to  Ril. 
which  makes  the  product  Rl6mi  independent  of  RL.  In  the 
second  case.  (24)  becomes 

(vl)'»-GJ*kTRL*f.  (28) 

This  is  just  the  noise  voltage  produced  by  the  resistance  of 
the  un.ii mission  lines  RL  in  the  band»«ith  A/- Au/2*. 
In  both  of  these  cases,  the  expression  for  the  minimum- 
detectable  electric  field  for  a  fixed  S/N  (26)  involves  cf 
and  only  two  parameters  that  describe  the  probe:  the  half 
length  of  the  dipole  h,  and  the  transmission-line  capaci¬ 
tance  Qj  (case  i)  or  the  transmission-line  resistance  RL 
(case  u).  Of  these  parameters,  a  variation  in  A  has  the 
greatest  effect  on  |£t‘|.  since  it  enters  the  expression  as  A'* 
when  Cj  ■«  CA  or  as  A'1  when  C;  »  CA,  whereas  the  other 
parameters  enter  the  expression  as  C/j1 /4  and  ft1/4. 

The  highly  resistive  transmission  line,  line  1,  must  be 
designed  to  not  interfere  with  the  reception  of  the  incident 
field  by  the  dipole  antenna.4  .This  is  accomplished  by 
making  the  transfer  function  for  a  wave  propagating  over 
the  line 

T  “exp(-  ywijc^r,)  (29) 

small,  as  in  (3a),  and  by  making  the  reception  of  the 
incident  field  by  the  transmissior  line  negligible.  The  ratio 
of  the  signal  received  by  the  transmission  line  to  the  signal 
received  by  the  dipole  proportional  to 

X|Zd/(ZtI+2*,)|  (30a) 

with 

x.  IM4*/*,)-1!  (fc|/A)(fo/2,;A)  (30b) 

and  f#  equal  to  the  impedance  of  free  space  |7],  The 

reception  by  (he  transmission  line  is  negligible  when  the 
dimensionless  parameter  x  “  small.  i.e.,  x  1-  With  t  and 
X  specified.  (29)  and  (30b)  can  be  rewritten  to  obtain 
expressions  for  the  resistance  per-unil-tcnglh  r{  and  the 
length  s,  of  line  1 

'J-MVA)(M4A/w,)-ll/2*Ax  (31) 

s,--ln(T)/^.  (32) 

Now  consider  a  reduction  in  the  size  of  the  probe  that 
leaves  the  performance  of  the  highly-resistjve  transmission 
line  approximately  unchanged,  i.e,  the  parameters  r  (29) 

*TV  uammmioa  line  muit  •Cto  be  designed  to  dial  die  iracvni  of 
eiKrjy  it  Kitten  ts  accrpubk  for  a  p  Articular  application.  FormuJu  for 
the  Kittmat  crow  seen*'  of  ft.  Une  we  10  |7J. 


and  x  (Kb)  unchanged.  The  dimensions  of  the  dipole  (A 
and  w4)  are  reduced  by  the  scale  factor  k,  (A,  <  1).  The 
widths  a  id  the  spucings  of  the  conductors  for  both  trans¬ 
mission  ! nes  ( wa,  wt,  and  A,.  A,)  are  also  reduced  by  the 
same  .calc  f'xtor.  The  capacitances  per-unit-lengtb  for 
both  of  Oe  transmission  lines  c,  and  fj  are  then  ncarty 
indepenue  t  c(  #,.* 

The  '^raining  parameters  for  line  1.  r[  and  s,.  are 
detmi*il  irom  (31)  and  (32)  once  the  constants  r  and  x 
are  sp'-ofied.  Note  that  the  resistance  per  unit  length  r{ 
must  be  increased  as  A,'1  and  the  length  a,  decreased  as 
A}''2.  The  scaling  for  r,‘  can  be  accomplished  by  holding 
fixed  the  thickness  iLl  of  the  resistive  film  forming  the 
conductors  of  the  line  as  the  size  of  the  probe  is  reduced 

(r{  *  1/ *  Aj). 

The  length  of  line  2  is  held  fixed,  since  it  determines  the 
spacing  between  the  dipole  and  the  ins'rumentation,  and 
the  rests tince  of  line  2  is  assumed  to  be  much  smaller  >h;ji 
the  resistance  of  line  1,  r{si  «  r/r,.  This  makes  th.-  capaci¬ 
tors  CLX  and  CtJ,  (5b)  and  (Sc),  and  the  cutoff  frequency 
uL  (7)  in  the  equivalent  circuit  for  the  transmission  lines 
nearly  independent  of  ij'jj: 

*  c^i/2 

"t',|^1i(ci*i+2‘2»j))*1- 

With  the  scaling  described  above,  the  dependence  of  the 
signal-to-noise  ratio  (2S)  on  the  scale  factor  k,  is  easily 
determined.  Usually,  Ct »  CA  for  very  short  dipoles,  mak¬ 
ing  the  numerator  of  (25)  approximately  proportional  to  A* 
or  A,4.  The  denominator  of  (25)  is  only  weakly  dependent 
on  A,;  (or  example,  when  kcut  the  denominator  is 
proportional  to  kfl/*.  Thus,  the  signal-to-noise  ratio  is 
seen  to  decrease  approximately  as  A,4.  The  same  argument 
shows  that  the  minimum-delectable  incident  electric  field 
)£T*|  for  a  fixed  S/N  (26)  increases  as  A,'1. 

In  Fig.  5,  the  minimum-detectable  incident  electric  fidd 
|£,'t  obtained  from  (26).  for  a  signal-to-noise  ratio  of  10  dB 
( S/N  -  3.16...)  is  shown  as  a  function  of  the  half  length 
of  the  dipole:  The  scaling  described  above  was  used  in 
preparing  this  graph.  The  parameters  chosen  for  the  dipole 
and  the  transmission  lines  are  h/wA  -  5.0.  c„  - 1.0.  A,/A 
-  0.2,  Cjj2  -10  pF;  those  for  the  diode  arc  C,-  0.1  pF, 
and  fio  -  20.0  A/W  (the  theoretical  value  for  an  ideal 
diode),  and  the  temperature  is  290  K.  fn  addition,  the 
dimensionless  parameters  t  (29;  and  x  (Kb)  are  assumed 
to  be  r  -  0.01  at  the  frequency  /- 100  MHz  and  x  “  0.01. 
Results  are  shown  for  the  Bandwidth  of  the  amplifier  equal 
to  1  Hz  and  equal  to  the  3-dB  cutoff  frequency  of  the 
transmission  lines.  Curves  are  presented  for  typical  values 
of  the  capacitance  per  unit  length  of  transmission  line  1, 

’The  Ihictnnan  of  the  thin -film  conductors  arc  a-sumed  I  null  com- 
pued  wuh  their  widths,  And  the  ihkknfM  of  the  dielectric  lutatrate  ts 
Assumed  Uife  compared  with  the  duoenuoas  of  the  trAfuauuKXh-lixie 
cross  sections. 
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Fit  5.  The  minimum-detectable  peak  electric  field  |£*t  for  a  10-dB 
ugnalto- noise  rauo  venus  the  dipok  half  kn|th  A  The  parameters  for 
the  theoretical  calculations  arc  given  in  the  teat  The  measured  dau  are 
for:  a)  BRH  Model  10.  Batten  and  Franke  (Jf.  /•  2 <5  GHz.  b)  Narda 
Model  2601.  Stuchlv.  eta!  (15).  / - 1  0  GHz.  c)  KIT  Model  979. Stuchly. 
rt  a I.  f  151  /•!  0  GHr.  d>  Holaday  Model  IMF -01.  Stuchly.  et  al  JJ5J. 
/-lb^rt*. 

c,  -  20  pF/m  an d  200  pF/m.  The  parameters  used  in  Fig. 
5  for  the  transmission  lines  are  only  typical  values;  changes 
in  these  parameters  will  affect  the  calculated  values  for 
|£,'l  However,  the  variation  in  |£;|  with  these  parameters 
will  be  fairly  slow,  since  they  enter  the  expression  for  |£,'| 
(26)  as  arguments  of  a  fourth  root. 

From  Fig.  S.  it  is  clear  that  the  price  paid  for  a  decrease 
in  (he  size  of  the  miniature  field  probe  is  a  drastic  decrease 
in  the  sensitivity.  Even  for  a  1-Hz  detection  bandwidth  and 
only  a  10-dB  signal-to-noise  ratio,  the  half  length  of  the 
dipole  must  be  greater  than  about  1  mm  to  measure  a  peak 
electric  field  of  1  V/m.  With  the  detection  bandwidth 
equal  to  the  3-dB  cutoff  frequency  of  the  transmission 
lines,  the  half  length  of  the  dipole  would  have  to  be  greater 
than  about  0.S  cm  to  measure  the  same  electric  field. 

Of  course,  a  decrease  ip  the  length  of  the  dipole  has  the 
advantageous  effect  of  increasing  the  maximum  electric 
field  that  can  be  measured  with  the  probe,  i.c.,  a  larger 
electric  field  can  be  measured  before  the  voltage  across  the 
junction  of  the  diode  is  sufficient  to  cause  a  departure  from 
square-law  response.  However,  there  are  other  preferred 
methods  for  correcting  for  non-square-law  respoi.se  that  do 
not  affect  the  sensitivity  of  the  probe,  such  as  the  use  of 
shaping  circuitry  in  the  monitoring  instrumentation  |16J. 

The  measured  results  of  other  investigators  for  probes 
with  half  lengths  in  the  range  0.75  mm  «  h  <  8.9  mm  are 
also  shown  in  Fig.  5  (3),  (15|.  In  all  cases,  the  measured 
data  have  been  converted  to  give  the  minimum-detectable, 
peak  electric  field  for  a  10-dB  signal-to-noise  ratio  with  a 
1-Hz  bandwidth.  The  measured  points  are  seen  to  follow 
the  trend  of  the  theoretical  estimate,  but  they  are  higher  by 
factors  of  three  to  twenty.  This  is  to  be  expected,  since 
some  of  the  parameters  for  these  pubes  are  quite  different 
from  those  used  in  the  theoretical  calculations.  The  theoret¬ 
ical  curves  can  be  considered  as  reasonable  estimates  of  the 


sensitivity  to  be  expected  from  miniature  electric-field 
probes  of  the  design  shown  in  Fig.  1  and  with  the  specified 
-/•lues  of  r  and  x- 

Currently,  the  beam-lead  diodes  available  commercially 
have  a  junction  size  of  the  order  of  100  pm.  Probes 
constructed  using  these  diodes,  c+en  with  some  modifica¬ 
tion  to  the  diode,  are  limited  to  dipole  half  lengths  greater 
than  about  h  —  0.3  mm  (4j  New  diodes  would  have  to  be 
fabricated  before  probes  with  smaller  dipoles  could  be 
constructed. 

VI.  Conclusion 

The  miniature  electric-field  probe  with  the  construction 
shown  in  Fig.  1  was  analyzed  to  determine  its  signal-to- 
noise  ratio  and  the  minimum-detectable  incident  electric 
field  for  a  fixed  signal-to-noise  ratio.  t\  method  for  scaling 
the  physical  dimensions  of  the  elements  in  the  probe  was 
presented,  and  the  variation  in  the  sensitivity  of  the  probe 
with  a  decrease  in  its  physical  size  by  the  factor  k,  (k,  <  1) 
was  examined.  The  signal-to-noise  ratio  for  the  probe  was 
found  to  decrease  approximately  os  kf,  and  the  minimum- 
detectable  incident  electric  field  for  a  fixed  signal-to-noise 
ratio  was  found  to  increase  approximately  as  k-  3.  The 
formulas  and  numerical  results  presented  should  be  helpful 
in  the  design  of  future  miniature  electric-field  probes. 
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Calculation  of  TM0n  Dispersion  Relations  in 
a  Corrugated  Cylindrical  Waveguide 

ALAN  BROMBORSKY,  member,  ieee,  and  BRIAN  RUTH 


Atetroct —Tb*  TM^  oodr  MtiwrB  rqmftoo*.  in  »  cvtindrfc*)  gfotw- 
try  Hv  converted  to  •  stilMKlnr  \yslttm  of  coupfed  linnr  tfiffcrc*ri«l 
equation,  in  nliick  tbc  boundary  conditiom  for  •  ol  ruym* 

diameter  are  included  In  tW  coefficient  matrix  of  the  itnir- teeter  sytfem. 
The  particular  problem  ol  periodic  boundary  conditions  b  aohed  for  a 
ntr^udr  with  a  ilmaoidaiy  undulating  waR. 

I.  Introduction 

HE  GENERATION  of  ulira-high-power  (-1  GW) 
microwave  pulses,  via  the  driving  of  slow-wave  struc¬ 
tures  by  intense,  pulsed,  relativistic  c'eciron  beams  (0.5  to 
-.0  MeV,  2  lo  15  UA,  15  to  100  ns)  [1],  |2|.  places  unique 
demands  upon  the  slow-wave  stra  ,ure  in  terms  of  Ihe  RF 
power  densities  (0.3  GW/cra2)  and  electric  fields  (400 
kV/cm)  present  in  the  struct  :.  Conventional  slow-wave 
structures,  such  as  the  heirs-  and  iris-loaded  waveguides, 
are  susceptible  to  high-field  breakdown,  and  hence  plasma 
fornatioa.  with  the  subsequent  shotting  out  of  the  slow- 
wive  structure.  What  is  required  for  ultra-high-power  de¬ 
vices  is  a  structure  with  a  periodic  wall  shape  lhat  does  not 
lead  to  undue  electric-field  intensification.  A  po'sible 
candidate  is  a  cylindrical  guide  in  which  the  waveguide 
diameter  vanes  sinusoidally  with  axial  position. 

However,  in  order  to  design  a  device  utilizing  such  a 
structure,  the  cold  waveguide  dispersion  relation  and  the 

Manuscript  received  Ausuit  31, 194).  revive  February  6. 1964. 

The  authors  sre  with  the  rvyarutiriil  ol  the  Army,  Hairy  Diamond 
Laboratories,  Adelphi.  MD  20713. 


electromagnetic  field  distribution  must  be  accurately  de¬ 
termined. 

The  basic  objective  of  this  paper  is  to  describe  a  tech¬ 
nique  for  computing  the  dispersion  relation  and  elc.tro- 
magnetic  fields  of  the  TM„.  modes  of  a  periodically  rip¬ 
pled  cylindrical  waveguide.  Please  note  lhat  the  technique 
to  be  described  also  can  be  applied  to  other  than  TM0. 
modes,  so  lhat  with  minor  changes  the  derivation  could  be 
quite  useful  in  calculating  TE  modes  in  tapered  gyrotron 
cavities  Also  note  that  the  source  terms  in  the  Maxwell 
equations  are  not  initially  set  to  zero.  This  is  done  so  that 
eventually  the  field  calculation  described  can  be  used  to 
compute  the  coupling  impedance  between  an  electron  beam 
and  a  propagating  waveguide  mode. 

II.  Scauno  of  Maxwell  Equations 


A.  Notation 

We  define  (in  MKS  units) 

r  free-space  speed  of  light, 

<0  permittivity  of  free  space. 

Mo  permeability  of  free  space, 

t)o  free-space  wave  impedance  (377  0), 

w  wave  circular  frequency, 

r,  8, :  cylindrical  coordinates, 

£„  £*,  E,  electnc-field  components, 

H„  Hf.  E,  magnetic- field  components, 

J„  J,.  J,  current-density  components, 

L  periodicity  length  of  slow-wave  structure. 
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Tbo  tingle-tar*  loop  antenna  ia  >  metallic  conductor  bent  into  the  ihapc  of  a  rioted 
cam,  net  aa  a  cirri*  or  a  tquarc,  with  a  gap  ia  the  conductor  to  form  the  ttrmiaal*. 
A  muhitura  loop  cr  cnl  ia  a  acriaa  ooantctioa  of  overlaying  lama.  The  loop  •  one  o( 
the  primary  aateaaa  atruetaraa;  iu  aw  B>  a  receiving  antenna  da  tea  bach  la  the  catty 
experiment!  of  Hertz  oe  the  propagation  a f  electromagnetic  wave*.1 

The  diacuaaioe  of  loop  antennal  ia  conveniently  divided  according  to  electrical 
aize.  Electrically  imal)  kept,  thoec  whoae  total  conductor  length  it  imtli  compared 
with  the  wavelength  ia  free  (pace,  ait  the  moat  frequently  encountered  in  practice. 
For  example,  they  art  commonly  tied  aa  receiving  antennal  with  portable  rxdtoe.  aa 
directional  antennal  for  radio-wave  navigation,  and  aa  probea  with  6eld-«ttenjph 
metera.  Electrically  larger  loopt.  particularly  tboac  near  reaonant  lira  (circumference 
of  loop/wavtkngtb  ra  I),  are  used  mainly  a*  ekinenu  in  directional  amyt. 

The  following  aymboti  arc  aaed  throughout  lb  chapter 

X  -  wavelength  in  free  apace  at  the  frequency  /  “  w/2t.  where  the  complex  har¬ 
monic  time-dependence  exp  (Ju l)  ia  tiaumed 
p  •  2t/X  •  propagation  cornual  in  free  apace 
f  -  VHjH  m  wave  impedance  of  free  apace  (•  3T7  Q) 
b  »  mean  rediua  of  a  circular  loop  or  mean  aide  length  of  a  iquarc  loop 
a  »  ndiua  of  loop  conductor  (AJ1  itaulta  preaented  art  for  thio-wire  loopt,  a/h  « 
D 

A  ■  area  of  loop 

N  m  Dumber  of  tanu 

t,  “  length  of  aoknoidal  coil 


5-2  ELECTRICALLY  SMALL  LOOPS _ 

The  axial  cantnt  diatribution  in  an  licctrically  amall  loop  ia  auumrd  to  be  aaiform: 
that  ia,  the  current  hit  the  umc  vatwi  /.  at  any  point  along  the  conductor.  Far  aingle- 
tura  loopt  aad  m'uliturn  loopt  that  at  tingle-layer  adenoidal  eoilt,  mcaaarcmcnta 
auggeat  that  thia  ia  a  good  aaaumptioe  provided  I  he  tout  length  of  the  conductor  {N 
X  circumference)  ia  amall  compared  with  the  wavelength  in  free  apace,  typically 
IS0.1X.  and  the  Icngib-todiamtler  ratio  for  the  aolenoidal  coil  ia  greater  thaa  about 
3  It, /2b  2;  3.0)'  With  a  uniform  current  auumed.  the  electrically  amall  loop  antenna 
ht  limply  analyzed  aa  a  radiating  inductor.' 


I* 


Tran  emitting  Loop 

The  electromagnetic  field  of  an  electrically  until  loop  antenna  ia  the  ■ 
t  magnetic  dipole  with  moment  m  »  ItNA: 


taa  that  of 


*♦“  *£7 


m 
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loop  Artanmt  M 


*'m:£r{i+??)''p'eo,t  <M> 

where  the  plane  of  the  loop  it  normal  to  the  polar  exit  of  the  spherical  coordinate 
eytum  (r.  t.  4)  centered  at  the  loop,  at  tbowo  in  F14.  3-1.  fa  the  far  tone  of  the  loop 
(iim  flr  —  ao),  only  the  leading  term*  in  Eqi  (3-1)  and  (3-2)  ait  significant,  and  the 


opAertcal  coordmet*  ayntam. 


PM2.  M  Far-tona  aarticaHtono  (aid  pat¬ 
tern  ot  an  electrically  anal  loop. 


field  pattern  for  both  E,  and  8,  in  the  vertical  plane  it  the  simple  figure  eight  ahowa 
in  Fig.  3-2. 

The  driving-point  voltage  and  current  arc  related  through  the  input  impedance 
of  the  loop.  V  ■  Z/*.  For  electrically  amall  loops,  the  impedance  it  the  aerie*  combi¬ 
nation  of  the  reactance  of  the  external  inductance  V  with  the  radiation  re* ounce  K 
and  the  internal  impedance  of  the  conductor  T  m  K  +  JuV: 


z -  nr  +  r  +  -  nr 4  K  +  /*£'♦  u\ 


<«> 


In  the  equivalent  circuit  for  the  amall  loop,  a  lumped  capeciuncc  C  d  aometimea 
placed  in  parallel  with  Z  to  account  for  the  diatribuled  capaciUncc  between  the  tidca 
of  a  tingle  turn  and  between  the  luma  of  a  adenoid,  at  ahown  in  Fig.  3-3.  Tbit  capac¬ 
itance  is  omitted  here,  since  in  practice  a  variable  capacitance  it  usually  placed  in 
parallel  with  the  loop  to  tune  out  its  inductance;  the  capaciUncc  of  the  loop  timply 
decreases  the  value  of  the  parallel  capacitance  needed.  Note  that  a  loop  with  a  truly 

uniform  currant  distribution  would  have 
no  capaciunce.  Since  from  the  equation 
of  continuity  there  would  be  ao  charge 
(li  along  the  cooduc-or  of  the  loop. 

The  radiation  resistance  of  the 
amall  loop  is  proportional  to  the  square  of 
£u>.n>  the  product  of  the  area  and  the  number 
of  luma: 


— » — 
i 

! 

I 

P*t. 

CT 

I 

I 

I 

— t— 


FIO.  M  Equivalent  era*  ior  Input  Im* 
ptdnoq  Z  of  an  tfactrtoa>y  wnal  loop. 


*  mh  ^SA)l 


(5-5) 


fr4  Typao  and  OeWQn  MaPiodi 


For  single-turn  !«V»  tod  soicnoidaj  coils  whose  turns  on  Ml  loo  closely  spneod.  the 
internal  impedance  it  approximately 

Z*  ■  s'  X  toul  length  of conductor  (5-6) 

■here  /  it  (he  internal  impedance  per  nail  lengih  of  a  straight  conductor  with  the 
Man  crou  icction  at  the  toon  cooductor.4  If  the  luma  of  the  coil  are  doocly  t paced, 
the  proximity  effect  moat  alto  he  included  in  detennininf  Z*.1 

The  external  inductance  it  determined  from  oat  of  the  many  formulas  available 
for  the  inductance  of  coils:4 


For  a  single-turn  circular  loop 


V  -  *4(ln  (14/a)  -  2) 

(8-7) 

and  for  a  single-turn  square  loop 

V  -  ^  (tn  (4/a)  -  0.774) 

(8-6) 

The  external  inductance  of  a  tightly  wound  single-layer  solenoida!  coil  of  length  t, 
and  a  radius  4  is  often  approximated  by  Lore  ns's  formula  for  the  inducunce  of  a 
circumferentially  directed  current  sheet.4  Numerical  results  from  this  formula  can  be 

pul  in  a  form  convenient  for  application: 

V  -  Wf-l/f, 

(5-9) 

•here  the  factor  K.  known  as  Nifsoks'i  consunt,  it  shown  as  a  function  of  the  ratio 
t,/2b  (lenfib  of  the  coil  to  the  diameter)  in  Fit-  5-4.  Note  that,  tor  a  Ions  coil  (f,/ 
2b  >  I),  K  «  1.  The  use  of  Eq.  (5-9)  assumes  that  the  turns  of  the  coil  arc  ao  closely 
spaced  that  the  winding  pitch  and  insulation  on  the  conductors  can  be  isnored;  if 
highly  accurate  calculations  of  V  arc  necessary,  corrections  for  these  factors  are  avail¬ 
able  in  the  literature  4 


Receiving  Loop 

When  the  electrically  snuff  loop  is  used  as  a  receiving  antenna,  the  voltage  developed 
at  iu  opencircuited  terminals  toe  ■>  proportional  to  the  component  of  the  incident 
magnetic  flux  density  normal  to  the  plane  of  the  loop  B1^ 

Vec  -  JoNAK  (8-10) 

where  the  incident  field  is  sisumcd  to  he  uniform  over  the  area  of  the  loop.  This  simple 
relation  between  hoc  and  Bf,  makes  the  small  loop  useful  u  a  probe  for  measuring  the 
mag-etic  flux  density.  If  t  rtlstion  between  the  incident  electric  snd  magnetic  fields 
at  the  center  of  the  loop  is  known,  For  can  bees  pressed  *»  terms  of  the  magnitude  of 
the  incident  electric  field  C  and  an  effective  height  A,  This  is  the  case  for  tn  inctdenl 
plant  wave  with  the  wave  vector  k,  and  the  orienution  shown  in  F if.  5-5: 

Ker  -  J*NAf  era  i,  sin  #,  -  kJUh  !,)£•  (8-11) 


where 


»J  -  Voc/e  -  »NA  coad,  sin  I, 


(8-12) 


WO.  M  Nagaofca'e  contram  (C  tor  ■  adenoidal  ool  aa  a  fietcOon  of 
Ow  eoa  tangm  to  me  diameter.  tjtb. 


V 


no.  H  Hbiiww  Md  Incttant  on  re-  HO.  M  Th4v*nin  wuliiUnl  cOcdt  lor 
otvtng  loop  the  reoeMng  leap. 


The  voltage  term  an  arbitrary  load  impedance  Zt  connected  to  tbe  termini  I*  of  the 
loop  with  input  impedance  Z  it  determined  from  the  TMvenin  equivalent  circuit  in 

r«.M: 


n  -  y«zj(z  +  Zi> 


(5*13) 


M  Typaaan«Oaal»itMadb 


Fwrtt*-Lo«d«d  (toc*Mng  Loop 

The  open-circuit  voltage  M  Ux  terminals  of  the  electrically  tmall  receiving  loop  can 
bo  increased  by  BUing  tho  loop  with  a  core  of  permeable  material,  uaally  a  ferrite. 
The  effect  of  the  coca  ie  to  inertaae  the  magnetic  flu  through  the  area  of  the  loop,  aa 
illustrated  in  Fig.  5-7  lor  a  adenoidal  coil  with  a  cylindrical  core  plaoed  to  a  uniform 
axial  magnetic  field. 

The  ferrite  material  ie  characterized  by  a  complex  relative  initial  permeability 
*  m  a/ae  m  K~  J*“  and  a  relative  permittivity  e,  «  e/e*.*  The  materia)  it  uaually 
•elected  to  have  a  loot  tangent  pm  »  «;/jd  which  it  email  at  tbe  frequency  of  opera- 
lioeL  and  consequently  p’  is  ignored  in  the  analysis  except  when  the  power  dissipated 
in  the  core  is  being  calculated  The  dimensions  of  the  core  ire  also  assumed  to  be  small 
compered  with  tbe  wavelength  in  the  ferrite  h.  *  X/  to  prevent  internal  reso¬ 
nances  within  tbe  core,’ 

The  open-circuit  voltage  for  a  single-turn  loop  at  the  middle  of  a  ferrite  cylinder 
of  length  /,  and  radios  5  is  increased  by  the  factor  sm  over  the  value  for  the  mine 
loop  in  free  space: 

Voc  ■  jvu^Af,  (**14) 


COit.  IN  INOOCNT  FIELD 


coil  with  rtnniTc  coke  in  incident  field 
PM.  h-7  Etact  of  a  cySndrteal  terms  owe  on  the  mag¬ 
netic  lux  through  a  aolenoWal  coi. 


•Tw  iartisl  pshnsskfUiy  is  Iks  Ssmslhu  iH*H  ie  Iks  Lair  ss  Mu  rWsesS  is  ma*  DwUcinc  kws 
la  Iks  fsmts  is  ipwvS  ksrt,  ssd  tks  gsrwnuviiy  is  sMSud  w  Ss  fuel. 


7-^? 


Here  the  radio*  of  Ik*  loop  ooodactor  « it  ignored,  tad  Ike  atcaa  radre*  of  tk*  loop 
tad  Ik*  con  on  tunned  lo  kt  Ike  it  me  «tluc  *.  The  graph  it  Fi|  H  itwi  tke 
ippticat  permeability  tcUl  ftaetioa  of  the  lengtbHodiametcr  rtuo  tor  the  rod 
t,/2b  witk  tk*  relative  iaititl  permeability  of  Ik*  ftrhu a,'  tt  t  parameter*  Suaiitr 
(npht  for  Ike  apparent  permeability  of  toiid  tad  koUo*  apbcroidal  earn  tie  ia  Ike 
kuntarc.* 

For  t  tiapMayer  toicaoidal  coil  of  Icagih  f,  centered  oa  tk*  rad,  aa  averaging 
factor  Ft  muet  be  included  ia  Ike  open-circuit  voltage  10  toooaal  for  Ike  docratu  ia 
lk«  flux  along  lb*  length  of  ike  toil  from  Ike  maxinam  tt  ike  middle: 

F*c  -J*+*FtNAV.  (5-15) 

The  empirical  factor  FV,  determined  from  an  t*ert|c  of  eipcrimcatal  retain,  it  thowa 
in  Fi|.  5-9  u  i  fuactioa  of  Ik*  ratio  /,//,  (length  of  ike  ooii  10  length  of  ike  rad).'*-" 
For  t  loop  rod  of  modtnu  permeability  (f,/2k  >  I.  mm  «  *C)  covered  by  a  coil  of 


z  «  a  •  2  «  a  a 

1  10  10* 


l,'2» 

HO.  *-•  The  apparent  permeability  i^atra  mton*  el  a 
cytndncal  rod  it  a  function  ol  m*  ItngtMomametar  mao 
t,!7b  with  the  tnrtlaJ  par-naaOtny  »;  a*  t  parameter 


M  Types  and  DawgnMrtiodi 


no.  t-%  The  factor*  F„  FL,  and  fm  as  functnna  of  *i* 
ratto/,'/,  (length  ot  me  cod  to  length  of  the  rad).  Thaaa 
(actor*  war*  dafarwtod  tram  average#  of  experimental 
data. 


equal  length  “  I),  the  open-circuit  voltage  is  increased  by  approranately  Ike 
factor  O.t  n  over  the  opeo-circuif  vofugc  for  the  tame  ml  without  the  cor. 

The  equivalent  circuit  for  the  impedance  of  the  ferrite-loaded  aokaaida!  coil  is 
that  in  Fig.  5-1  with  as  additional  aeries  resistor  *■  included  to  account  for  the  power 
dissipated  ia  the  core.  The  elements  i»  the  circuit  are: 

the  radiation  resistance 

(5-16) 

the  resistance  due  to  core  toss 

-  -OWnaVr-aFa/V'if/f,  (8-17) 

the  asternal  inductaaec  of  the  loaded  solcooidal  coil 

V  -  a, Jour  Alt,  (6-18) 

The  internal  impedance  of  the  conductor  Z*  is  assumed  to  be  Ihc  same  as  that  for  the 
■idooded  loop.  The  empirical  factor*  Ft  sad  Ft  in  Eqs.  (1-17)  and  (MIX  like  Ft 
were  determined  from  ao  average  of  experimental  results  cod  ere  also  shows  as  o 
function  of  the  ratio  /,/f,  ia  Fig.  H.1*"  It  should  be  emphasised  that  the  graphs  for 
the  three  factors  Ft  Ft,  tad  F t  represent  typical  measured  values  and  show  only  tbc 
dependence  on  the  ratio  /,//„  some  dependence  on  the  other  parameters  describing 
(be  coil  and  the  rod  *  to  be  taped ed. 

Equations  (1-15)  through  (1-11)  provide  t  complete  description  of  tbc  electri¬ 
cally  small  ferrite  loaded  receiving  loop  (single-layer  solenoids!  coil  with  ■  cylindrical 
con);  other  parameters  of  interest,  each  at  the  Q  tt  the  antenna,  can  he  determined 


IsVsM- 


n-5t 


from  tb«M  results.  TW  pcnnmbsiily  of  t  ipecific  fcrritt  cu  bo  obtained  from  the 
Biulicintr  or  from  tbc  cxunaivu  rabies  tod  cboru  ia  Rtf.  It.  The  many  parame- 
tt.t  that  on  10  bo  cbotco  for  the  fcnitoJoaded  loop,  tack  a s  «C  /„  f„  AT,  tie.,  offer  a 
gnat  deal  of  flexibility  ia  hi  design.  Then  an  several  discussions  ia  tbc  literatim  that 
determine  these  parameters  to  optimize  the  performance  for  a  particular  application.11 

The  electromagnetic  field  of  the  ferrite-loaded  transmitting  loop  is  given  by  Eqs. 
(5-1)  to  (M|  nth  the  moment  at  *  ihmFrieNA-  Tbc  ferrite-loaded  loop,  however, 
it  seldom  used  as  e  transmitting  antenna  because  of  the  problems  associated  with  the 
nonlinearity  and  tbc  dissipation  is  the  ferrite  at  high  magnetic  field  strengths.'1 

8-3  ELECTRICALLY  LARGE  LOOPS _ 

At  the  electrical  sire  of  the  loop  antenna  it  increased,  the  current  distribution  in  the 
loop  departs  from  the  simple  uniform  distribution  of  the  electrically  smell  loop.  For 
single-turn  loops,  this  departure  has  a  significant  effect  on  performance  when  the  cir¬ 
cumference  is  greater  this  about  0.  IX.  For  example,  the  radiation  resistance  of  an 
electrically  small  circular  loop  with  a  uniform  current,  as  predicted  by  £q.  (5-5),  is 
about  86  percent  of  the  actual  resistance  when  fib  ■  2wb/X  m  0.1  and  only  about  2fi 
percent  of  the  actual  resistance  when  fib  m  0.3. 

Of  the  possible  shapes  for  aa  electrically  large  loop  antenna,  the  single-turn  thin- 
srirc  circular  loop  has  received  the  moat  attention,  both  theoretical  and  experimental. 
The  popularity  of  the  circular  loop  is  due  in  part  to  its  straightforward  analysis  hv 
expansion  of  the  current  in  the  loop  u  a  Fourier  series: 
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where  the  ingle  0  is  defined  in  Fig.  5-1. M  Measurements  on  electrically  large  loops 
with  other  shapes,  such  u  the  square  loop,  show  that  their  electrical  performance  is 
quelitativcly  similar  to  that  of  the  circular  loop;  therefore,  only  the  circular  loop  will 
he  discussed  here.'1 

Circular-Loop  Antenna 

The  theoretical  model  for  the  circular-loop  antenna  assumes  a  point-source  generator 
of  voltage  V  at  the  position  d  •  0.  nuking  the  input  impedance  of  the  loop  l  •  R 
+  JX  ■  V/Ht  “  0).  In  practical  applications,  tbc  full-loop  antenna  it  usually  driven 
from  n  balanced  source,  such  as  a  parallel-wire  transmission  line,  end  the  half-loop 
antenna,  the  analog  of  the  electric  monopote,  is  driven  from  a  coaxial  line,  si  in  Fig. 
5-10.  The  point-source  generator  of  the  theoretical  model  contains  no  deuils  of  tbc 
geometry  of  the  fead  point,  and  it  is  not  strictly  equivalent  to  either  of  these  methods 
of  ciciutkm.  However,  theoretical  current  distributions,  input  impedances,  and  field 
patterns  computed  with  the  point-source  generator  and  20  terms  in  the  Fourier  scries 
(Eq.  (5-l9)|  arc  generally  in  good  agreement  with  measured  values.*  Thus,  the  theory 
serves  as  a  useful  design  tool. 

In  Figs.  5-1 1  and  5-1 2,  the  input  impedance  of  ■  loop  constructed  from  a  perfect 
conductor  is  shown  as  a  function  of  the  electrical  size  of  the  loop  fib  m  2» b/k  (cir- 
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no.  a-11  Input  ranetanoa  el  circular-loop  Mmi  wm  Wacutcal 
Hz*  (csrosnrorcnoo/wuveWoqm) . 

cumfcrcncc/wavcleitgth)  for  various  values  of  the  radius  of  the  conductor,  indicated 
by  the  itiriwu  parameter  S  •  2  In  (2v b/a).  These  impedances  are  for  full-loop 
aatcanas;  for  half-loop  antennas  with  the  same  radius  and  conductor  sue.  impedances 
are  approaimalcly  one-half  of  these  values.  The  reactance  X  a  leen  to  he  sera  at  points 
near  fib  -  (aotircsonant  points)  and  fib  -  1,2,3,...  (icsonaat  points). 

The  resistance  obtains  relative  maxima  near  the  points  of  antiresonanoe  and  relative 
minima  near  the  points  of  resonance.  Impedances  computed  from  Eqa  (5-))  and  (5- 
1).  which  apply  to  electrically  small  loops,  are  also  shown  in  FifS.  5-1 1  and  5-12;  the 
inaccuracy  of  these  formulas  with  increasing  fib  k  evident. 

When  the  electrical  size  of  the  loop  is  Dearths!  for  resonanoe  (0b  ”  1,2,1,...), 
the  rtntnsaant  term  in  the  Fourier  series  for  the  curreo:  (Ex).  (5-19))  is  the  one  with  « 
•  integer  (dh)  For  example,  near  the  6rst  resonance  fib  m  I,  the  currant  in  the  loop 
is  approximately  /(♦)  -  2f,  cos  4.  and  the  loop  is  commonly  referred  loss  a  resonant 
loop.  The  resonant  loop  (fib  m  I)  is  the  matt  frequently  used  electrically  large  loop. 
It  has  a  reasonable  input  resistance.  Km  100  D,  for  matching  to  a  transmission  line, 
particularly  when  compared  with  the  resistance  of  the  aatinaoaanl  loop  (fib  m  0.3), 
which  may  be  larger  than  10  kft 
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no.  S-1*  ScftanttHc  e*  amtt  attribution  ki  rtaonaM  loop  (•) 
•ndinti*  «op>oxjm«!*ty  »quh>l«nl  ptir  cX  tpalw  (»). 


IlMOAint  Circular  Loop 

Tbe  current  ■  the  mogul  bop  hat  maxima  at  the  generator,  4  m  0,  tad  at  tbe 
diametrically  opposite  point,  p  -  c.withaodcaatp  *  »/2  and  Jv/2.  Ob  cxaminaiioe 
of  Fip.  5-1).  the  current  b  ten  to  be  roughly  equivalent  lo  that  la  a  pair  of  parallel 
dipole  entenoai  drives  la  phase  and  with  a  spacing  approximately  equal  to  tbe  diam- 
ctcr  of  tbc  loop. 

The  far-toae  ltd  pattens  for  the  rcaonaat  bop  than  ia  Fig.  S-lte-r  arc  algo 
timilar  to  tboee  for  the  pair  of  dipolee;  they  have  little  rraemblaace  to  tbe  figurweight 
patten  of  tbc  electrically  email  bop,  Fig.  5-2.  There  are  ten  eoatpooeotg  to  tbc  elec¬ 
tric  field,  £ « cad  E,  it  zero  ia  tbc  horizontal  platte  t  •  r/2  cad  ia  tbe  vertical 
plane  4  •  0,  e.  while  £,  it  imall  ia  tbc  vertical  plaae  4  •  e/2,  Je/2.  The  amplitude 
patterns  are  *ymBietrical  about  tbe  planet  I  •  e/2  and  p  •  0,  e  owing  to  tbc  r» 
metrical  rymmeuy  of  tbe  loop,  tnd  they  are  aearly  symmetrical  about  tbe  plane  4  - 
r/2,  Jr/2  owing  to  tbe  dominance  of  the  term  2/(  on  p  in  tbe  current  distribution. 
At  tbe  maxima  (#  «•  0,  e)  of  tbe  bidirectional  patten,  tbe  electric  field  is  liaeariy 
polarized  in  the  direction  f . 

To  help  us  visualize  tbe  electric  field,  three-dimensional  emptiludc  patterns  for 
tbc  electrically  small  bop  and  the  resonant  loop  art  presented  in  Fig.  5-1 S.  Each  draw¬ 
ing  is  a  series  of  patterns  on  phots  of  constant  angle  P;  only  the  patterns  in  the  upper 
hemisphere  (0  515  e/2)  arc  shown,  since  Ihoee  in  tbe  lower  hemisphere  art 
identical. 

Tbe  directivity  of  the  circular  loop  in  tbc  direction  *  0  or  e  ia  shows  at  a 
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HO.  HI  Fur-rone  uuctnc  M  pattens  In  upper  hunueptsm. 
(a)  ESctrieuy  tma«  loop.  06  c  1.  <b)  Beeonewt  bcp. d»  »  1.0. 


*14  Typo*  and  DaalpUaMdt 

frtctioa  o#  tit  electrical  *i**  fit  ia  Fig.  S-l  4;  il  i>  tboul  1.4  dB  for  ft  “  1.0  tad  ha* 
*auiauic(ibaNOM(«ft  »  1.4.  The  directivity  it  fairty  independent  of  the 
penmner 0 ford*  $  1.4. 

Tht  mo— H  bop  anicaaa  it  tltradivt  for  practical  application  became  of  itt 
moderate  iapat  man  an*  aad  tymaietrical  held  patter*  with  rcaaooabte  directivity. 


0  to  to 


no.  h-10  DnctMly  ot  ovcular-ioop  entema  to*  t  •  0. 
*  venue  electric*)  alto  (cecmOennc* /woiwtangta). 


no.  *17  Oeecttvny  ot—o— rtoop*m«n *.»»  -  t  o,  toe* 
-  0  venue  dteienoe  from  reiector  d/V  Theoretical  am*  *  tar 
MnA*  planar  re— Otar;  meeaved  porn  an*  tar  equar*  rv—cur. 
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Loop  Veenraa  t-lf 

TV  bidirectional  Mian  of  its  pattern.  however.  ii  nasally  act  desired,  and  a  reflector 
■r  an  array  of  ioope  it  wed  M  Bake  tV  pattern  aadirectiooal. 

Circular  loop  wtth  Manor  IMVctor 

TV  patter*  of  IV  moMat  loop  it  Bide  unidirectional  and  tV  directivity  ia  the  direc¬ 
tion  I  -  Ok  increased  by  placing  IV  loop  over  I  planar  reflector.  TV  theoretical 
Wkltt  for  u  infinite  perfectly  conducting  reflector  (Fig.  S-17)  ibcm  tVt  the  directn- 
ky  it  greater  then  9  dB  for  rpecuigi  between  the  loop  end  the  reflector  to  the  range 
0.0S  £  d/k  £  0.2-'*  Over  tbit  tame  range  of  tpe  rings,  the  inpet  impede  nee  Z  «  JT 
+  JX  (Fig.  '-1I)  hat  reluct  which  arc  ratify  Bitched;  the  reuttaacc  it  reuonibtc  (JT 
£  1 15  Q),  and  tV  reactance  it  email  (|JT|  <,  20  0). 

TV  theoretical  retolu  for  in  infiniw  reflector  uc  in  good  agreement  with  ma¬ 
tured  data  for  finiu  square  rcflecton  of  tide  length  i.  TV  directiritica  matured  by 
Adechi  and  Mushiike1’  (Ftp.  S-17)  for  a  reflector  with  t/k  -  1.2  and  d/k  £  0.26 
are  tHghtly  higher  than  tboic  for  an  infinite  plane,  while  the  input  impedances  ma¬ 
tured  by  Rojarayaooet  and  Seiiguchi"  (Fig.  Sit)  show  rariation  with  reflector  tire. 
0.41  £  t/k  £  0.95,  but  general  agreement  with  tV  retulu  for  an  infinite  plane. 


o  o*  as  as  on  as  o«  or 


«/h 

RO.H*  bend  Impedanoe el CW0UV400P anteram,  g»  •  1.0  rer¬ 
un  CMance  bom  reSactor  dik.  Theoretical  eurvea  are  lor  HMe 
planer  reSector,  measured  pdret  are  lor  equare  rejector. 
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Electric  field  pattern  matured  by  RptarayaaoM  aed  Sekiguchi**  far  reeonaat 
kept  ow^iurur  wavelength,  rf/X  “  0.25.  in  front  of  eqvare  reflectore  in  tbowa  is 
Fi*.  Hf.  The  ehaded  area  la  each  figure  above  the  ranaooa  is  the  patten  that  ie  a 
rath  cf  changing  the  tut  of  the  aquare  reflector  from  r/X  •  O.M.'tj/l  m  0.95. 


f*v/2.Sv/2  0*0.* 


tat  tkl 

no.  a-ia  Matured  tar-iona  Hecate  Md 
panama  tor  loop  vtth  tb  -  1.0  one  aquare 
redactor,  d/x  -  OJS  kmar  one  a/X  -  O  H; 
outer  curve  a/X  -  0.04  (a)  vartlcamiana 
Md  potent  |*.|.  «  •  v/i.  5./J  (ft)  Varo- 
catplane  a*M  pattern  |*,|.  «  -  0.  *.  fMav- 
awad  OMa  *am /Marayanonr  and  &Xv^-etf  y 


Coaxial  Array*  of  Circular  Loop* 

Loop  antennae,  like  linear  antennae,  can  be  combined  in  an  array  to  improve  perfor¬ 
mance.  The  moat  common  array  of  circular  loop!  it  the  coaxial  array  in  which  ail  the 
loopt  arc  parallel  and  have  tbeir  cental  oq  a  common  axie;  an  example  of  a  coaxial 
array  ie  ehown  lata  in  the  meet  of  Ftp.  5-21.  The  Fouricr-ecria  analytic  far  the  ample 
loop  ie  oaeily  extended  to  the  coaxial  array  whoa  all  the  driven  loope  arc  fed  at  a 
common  angle.  e.g.,  #  -  0  in  Ftp.  5-1.  The  current  dietributica  ia  each  loop  ie 
expreeaed  aa  a  ecrica  of  trigonometric  terma  like  that  in  Eg.  (5-19).  The  aimpltdty  of 
the  analytic  retulte  from  the  orthogonality  of  the  trigonometric  lenne  which  make* 
the  coupling  between  loopt  occur  only  fa  terme  of  the  earnc  order  a.  Thue.  if  all  the 
drive*  loopo  in  the  array  arc  near  rteooant  lire,  pb  a  I.  the  term  n  ■  I  it  the  dom¬ 
inant  one  in  the  current  dietributioox  fa  nil  loope;  i.e.,  the  current  «  approximately 
proportional  to  cm  *  ia  all  loope. 

When  all  the  elemcnte  in  the  loop  array  are  driven,  the  tame  procedure!  that  am 
aaed  with  arreyi  of  hnev  elemcnte  can  be  applied  to  eelcct  the  driving-point  voltegca 
to  optimiac  certaio  parameter!,  lucbae  directivity. feed  arrangement  needed  to 
obtain  the  preacribed  driving-point  voluget,  however,  ie  very  complex  for  mom  than 
•  few  element*  in  the  array.  Aa  a  reeult,  a  umpter  and  more  eoonomicaJ  arrangement, 
aa  array  containing  only  ooe  driven  element  and  levtrxl  panaitk  loopa,  ie  often  need 
(a  panuUc  loop  I a  a  coo  turnout  wire  with  no  tenninalt). 

When  a  tingle  c lately  i paced  pnneite  i»  wed  with  e  driven  loop,  the  pantile 
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may  cl  u  a  director  or  at  a  re  Recur.  This  it  illuttrxtcd  in  Fig.  $-20.  m  which  electric 
field  pettems  arc  ihowa  (or  a  drives  loop  (fib,  "  1.0)  and  a  parasitic  loop  with  the 
spacing  d/A  •  0.1.  For  loops  of  the  same  electrical  size  {fib,  •  fib ,  ■  1.0),  tbemaa- 
ima  is  the  patters  at#  *  0.  *  arc  nearly  equal.  The  parasitic  loop  that  is  slightly 
smaller  than  the  drives  loop  (fib,  ■  0.95)  acts  as  a  director,  producing  a  maximum 
is  the  pattern  Kl  •  r,  while  the  parasitic  loop  that  is  slightly  larger  thaa  the  driven 
loop  (fib,  ■  1.05)  acts  as  a  reflector,  producing  a  maximum  in  the  pattern  at  I  ■  0. 
This  behavior  is  very  similar  to  that  observed  (or  a  resonant  linear  antenna  with  a 
closely  spaced  parasite. 

The  driven  loop  of  electrical  size  fib,  -  1.2  (Dj  •  1 1,  s,  ■  a,)  with  a  single 
parasite  was  studied  in  detail  by  Ito  ct  at."  In  that  study,  tbc  optimum  director  was 
determined  to  be  a  loop  with  fib,  »  0.95  and  spacing  d/A  »  O  ld.  this  produced  a 
directivity  of  about  7  d  Bald  m  tr.  The  optimum  reflector  was  a  loop  with  fib,  »  1.08 
and  a  spacing  d/A  nr  0.15;  this  produced  a  directivity  of  about  8  dB  at  (»  0.  Note 
that,  for  this  case,  the  optimum  director  and  the  optimum  reflector  arc  both  smaller 
than  the  drives  loop. 

A  Yagi-Uda  array  of  loops  with  a  tingle  reflector  (element  I).  an  exciter  (the 
driven  element  2),  end  seven)  directors  of  equal  size  fib  and  equal  spacing  d/A  is 
thaw*  m  the  inset  of  Fig.  5-21*  As  is  its  counterpart  with  linear  dements,  in  the 
Yngi-Uda  amy  of  loops  the  re  fleet  or -exciter  combination  acts  is  a  feed  for  a  slow 
wave  that  propagates  along  tbc  amy  cd  directors.”  The  lowest-ordcr  propagating 
wave  (mode)  exists  for  directors  la  than  about  resonant  size  (fib  <  1.0)  with  (pac¬ 
ings  leas  than  about  a  half  wavelength  (d/A  £  0.5).°  An  amy  supporting  this  mode 
has  aa  cad-fire  patten  with  a  linearly  polarized  electric  field  at  the  maximum.  I 
-  0. 

The  procedure  foe  designing  a  Yagi-Uda  amy  c t  loops  is  the  seme  at  for  an 
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PM.  Ml  DtrectMty  of  YagKMa  my  of  render  loop  vMnru  tar  «  •  0 
number  of  dkoctora.  mrectarepnelng  dix  -  0.2 


array  with  line*'  elements  *  Tbc  notated  reflector-exciter  combination  d  asuatly  c  Ho¬ 
ren  to  have  maximum  directivity  in  tbc  direction  f  -  0.  For  example,  tbc  optimized 
twoelftnent  array  described  above  might  be  Med.  The  number,  sue.  and  > pacing  of 
the  direct  on  arc  then  adjusted  to  obtain  tbc  desired  performs  not.  such  as  a  specified 
end-fire  directivity.  Tbc  maximum  end- fire  directivity  is  determined  by  the  electrical 
length  of  tbc  array  L/X  (L  is  tbc  distance  front  tbc  exciter  to  tbc  last  director).  Tbc 
larger  the  number  of  directors  within  the  length  L  tbc  smaller  tbc  electrical  size  of 
the  directors  vrili  be  for  maximum  directivity,  typically  0.1  S  fib  S  1.0. 

As  an  example,  the  directivity  of  a  YagMJda  army  of  loop*  with  the  director 
spacing  d/X  -  0.2  is  shown  as  a  function  of  tbc  number  of  directors  or  the  length  oi 
the  array  L/X  in  Fig.  5-21.  Two  theoretical  curves  and  two  sets  of  measured  data  are 
shown.  All  the  results  agree  to  within  about  I  dB,  even  t bough  they  are  lot  different 
reflector-exciter  combinations  and  slightly  different  director  sizes.* 


Iha  aarawsun—td  hatha  dUffa  ie’maigitsr*  art:  Apest-Hassia.  Mi  -Ms*  MO.  ard- 
mimd  fir  ths  Uotatad  isdactar-sscitar.  sad  gS  asximasd  far  aach  hagth  tfk  Tata's  sad  Satioacfc.M,  * 
l-OXMi  -  IJAAi/X  ■  OH. aad  Manlauaad tm  aach  la^th  L/K  tSnausawS  saa  SSafai (IVTS). S,S 
•  lOi.th  -  1-IO.du/S  -  S  l.iadSa  »  Of  SwsjtU^lto  i/X 
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For  otttila  appttealieae,  k  k  dcairabk  to  poakioa  the  terminal!  of  tke  loop  oouano 
pradaely  oo  to  to  predate  geometrical  rymmetry  far  the  loop  and  Hi  eoaaectioaa  about 
•  pltao  pcrpoadicalar  to  Ike  loop.  Thu  tea  often  be  accost  pluhed  by  ueing  the  ». 
caked  tbielded  loop;  Fig.  S-22ek  an  example  of  a  akielded  receiving  loop  wboacextcr- 
aal  aerface  k  ayaunetrical  about  ike  yt  pUne.** 

With  reference  to  Fig.  5-22  a,  the  tkickaeei  of  Ike  metal  fanning  tke  ikield  k 
ekeecn  to  be  aceeral  ikin  deptbe;  tkk  prevcnti  any  direct  inunction  between  the  cur- 
mu  on  tke  iatenal  and  tke  external  lurfaece  of  tbe  ibicld.  The  effective  terminal!  of 
tke  loop  antenna  arc  at  the  cade  of  tke  mall  gap  AM.  Tbe  inner  conductor  aad  tke 


(0)  let 

FtO.  *-»  gNoMod  loop  antenna  <e)  wICtoqutvalateantoma  lb)  one 
oqdmlonl  OananOaolon  too  (c). 


MO  typMMOMfcnlMhodi 

shMd  form  a  coaxial  trsasmiwioa  has  of  length  *  •+  r 6  coaaectiag  the  gap  with  the 
load  Impedance  Z$.  Tku,  the  effect he  load  impedance  Zt  at  the  gap  ■>  Zt  trans¬ 
formed  by  the  length  of  unwiiHoa  Hot  k  +  it. 

The  iwhin  inUMl  la  Fig.  Vila  *  cully  analyzed  by  eosuidcriag  the  loop. 
Fig.  5-226,  tad  tkc  treaaouaaioa  line.  Fig.  5-2  2  r,  separately.  Tbc  inctdcai  field  pro- 
dnoea  t  current  oa  tkc  external  snrfsce  of  the  shield;  tkc  eumM  pence  through  tbc 
effective  impede noe  Z*.  producing  the  voltaic  Vu  which  he  ee  electrically  imeU  loop 
cat  he  determined  from  Eqs.  (5-11)  tad  (5-13).  Tbie  voiugt  it  transmitted  over  tbc 
coaxial  tiae  to  become  V,  at  the  load  inpedaace  Z». 

Other  euaipiee  of  the  shielded  loop  art  shown  ia  Fi|.  5-23.  A  balanced  version 
of  the  loop  ia  Fig.  5-22a  ia  ia  Fig.  5-2) a.  aad  a  method  lor  feediag  a  loop  ia  front  o( 
a  plaaar  reflector  is  ia  Fig.  5-2)*. 


To  illumu  a  typical  uc  of  the  shielded  loop,  consider  the  electrically  small 
receiving  loop  placed  ia  an  incident  electromagnetic  plane  wave  with  the  wave  vector 
lk»is  Fig.  5-24.  This  is  the  tame  geometry  u  in  Fig.  5-5.  escape  that  the  terminals 
of  the  loop  are  at  the  angle  p  w  Pt  instead  of  p  »  0,  and  p,  -  ».  i,  »  0.  The  loop 
far  thin  tumple  might  be  an  antenna  in  a  direction  finder  with  the  direction  of  the 
laddc-it  wave  to  he  determined  by  placing  a  anil  of  tbc  field  pattens  ia  the  direction 
Cfke 

Tbc  voltaic  at  tbc  open-circuited  terminals  of  the  electrically  email  loop,  deter¬ 
mined  from  the  Fourier-ecrics  analysis,  ia  approximately 

Vac  -  I,  -  Jtfb  oos  pj  (5-20) 

For  many  applications,  the  second  term  in  Eq.  (5-20)  1s  negligible,  since  06  «  I  for 
on  electrically  until  loop',  in  this  event,  Eq.  (5-20)  reduces  to  Eq.  (5>l  I)  with  N  •  I. 
Pi  *  0.  Ia  other  applications,  however,  this  term  msy  represent  a  significant  cootri- 
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tntion  to  Ike  response.  Far  example,  the  sensitivity  of  the  utiui  hi  the  direction 
finder  it  decreeied  by  thie  term  because  it  fills  in  the  nulls  of  the  sin  I,  field  pattern 
(for  fib  m  0.1.  dt  ■  0.  the  minim*  in  the  pettera  are  only  14  dft  below  the  mtxima). 

The  second  term  in  Eq.  (5-TO)  can  be  made  insiinificant  by  reducing  the  elec* 
trie*)  sis*  of  the  loop  fib-,  however,  this  will  also  decrease  the  sensitivity,  sieoe  the  area 
of  the  loop  is  decreased.  An  alternative  is  to  make  this  term  sera  by  pUaag  the  ter¬ 
minals  of  the  loop  precisely  at  -  ±e/2(oosdt  -  0*.  this  can  be  accomplished  by 
using  a  shielded  loop  as  in  Fig.  J-22o  or  Fig.  $-23*. 


5-5  ADDITIONAL  TOPICS 


The  brevity  of  this  review  requires  omission  of  many  interesting  topics  concerning  loop 
antennas,  la  recent  years,  there  hat  been  considerable  study  of  loop  antennas  in  dose 
proaimity  to  or  embedded  in  metci  itl  media  such  as  the  ocean,  the  earth,  or  a  plasma. 
The  electrical  characteristics  of  loops  in  these  instances  can  be  quite  different  from 
shoe*  of  loops  in  unbounded  free  space,  as  described  in  tbit  review.  The  mayor  appli¬ 
cations  of  thie  wc*k  arc  in  tbc  areas  of  subsurface  oonununicatioa  and  dctecLoa  (geo¬ 
physical  prospecting) 

Tbc  loop  antenna  near  a  planar  interface  separating  two  semi-infinite  material 
regions,  such  as  the  air  and  the  earth,  baa  been  investigated  citctnivcly.  When  the 
bop  i*  electrically  totall.  it  can  be  approximated  by  *a  etemeatary  magnetic  dipole, 
and  the  electromagnetic  field  away  from  the  bop  can  be  determined  from  the  classical 
analysts  of  Sommerfcld.u  If  tbc  field  Bear  tbc  electrically  small  bop  it  required,  the 
approximation  by  a  magnetic  dipole  may  no  longer  be  adequate,  and  a  bop  with  a 
finite  radius  and  a  uniform  current  mast  be  considered1*  For  the  electrically  Irrge 
bop  near  a  planar  interface,  an  analysis  that  allows  a  nonuniform  current  in  the  bop, 
such  as  the  Fouricrwerict  analysis  for  the  circular  loop.1’  mutt  he  used. 

Tbc  performance  of  a  bop  embedded  in  a  material  caa  be  eherad  significantly 
by  placing  tbc  bop  b  a  dielectric  eerily,  (tick  as  •  sphere,  to  form  en  Inmbtad  bop. 
11m  electrical  size  end  shape  of  the  insulating  cavity  and  tbc  location  of  tbc  bop  ■ 
the  cavity  tan  be  used  to  control  the  electromagnetic  field  and  input  impudence af  tbc 
antenna. 
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CURRENT  ANTENNA  NEAR-FIELD  MEASUREMENT  RESEARCH  AT  THE  GEORGIA 
MSTITUTE  OF  TECHNOLOGY 


Virginia  V.  3ory»,  Edward  B.  3oy*,  w.  Marshall  Leacn,  3r.* 

ABSTRACT 

Current  progress  in  antenna  near-field  measurement  techniques  is 
presented.  Emphasis  is  given  to  probe  compensation  lor  the  spherical 
measurement  system  and  to  correction  for  probe  position  error  in  the  planar 
measurement  system.  Also  discussed  are  the  recent  design,  implementation  and 
validation  of  a  cylindrical  near-field  range. 

INTRODUCTION 

The  Georgia  Institute  of  Technology  has  been  active  in  the  research  and 
development  of  near-field  measurement  techniques  for  over  a  decade  and  has 
had  an  operational  planar  near-field  scanner  since  1968.  Recently  a  cylindrical 
near-field  range  has  been  designed,  implemented,  and  validated.  The 
implementation  of  a  spherical  near-field  range  currently  is  under  way. 

Since  each  of  the  planar,  cylindrical  and  spherical  near-field  measurement 
systems  has  advantages  for  particular  applications,  research  continues  in  each 
coordinate  frame.-  The  planar  measurement  system,  for  example,  is  suitable  for 
directive  antennas  with  pencil  beam  radiation  patterns,  while  the  cylindrical 
measurement  surface  is  appropriate  for  fan  beam  antennas  with  narrow  beams 
in  the  vertical  direction.  The  spherical  measurement  system,  which  enjoys  the 
distinct  advantage  of  providing  a  complete  measurement  surface,  is  suitable  for 
omni  directional  antennas.In  each  case  the  measurements  are  carried  out  in  the 
near-field  of  the  antenna  under  test}  thus,  unless  an  ideal  probe  is  used,  the 
effects  of  the  probe  antenna  need  to  be  taken  into  account. 

A  method  of  computation  of  far-field  radiation  patterns  from  probe 
compensated  near-field  measurements  may  be  described  briefly  as  follows.  In 
each  coordinate  system,  the  fields  radiated  by  the  antenna  under  test  and  the 
probe  antenna  are  expressed  in  terms  of  the  appropriate  (plane,  cylindrical, 
spherical)  wave  expansion.  The  Lorentz  reciprocity  theorem  is  then  utilized  to 
obtain  coupling  equations  between  the  unknown  fields  radiated  by  the  test 
antenna  and  the  known  fields  of  the  probe.  The  unknown  mode  amplitudes  are 
determined  from  the  coupling  equations,  and  finally  the  far-field  patterns  of  the 
test  antenna  are  calculated  from  the  mode  amplitudes  (1 ,  2) 


When  a  portion  o  1  Georgia  Tech's  Engineering  Experiment  Station  was 
relocated  to  a  site  fifteen  miles  from  the  main  campus,  continued  use  of  the 
centrally  located  near-field  range  became  impractical.  An  automated  near-field 
measurement  system  was  designed  employing  cylindrical  geometry.  Automatic, 
digital  recording  of  the  near-field  data  is  accomplished  using  a  microprocessor 
based  controller  which  handles  the  stepping  and  scanning  of  the  antenna  end 
probe  positioners,  and  by  using  a  microcomputer  with  parallel  interface,  to  read 

♦Engineering  Experiment  Station,  M School  of  Electrical  Engineering,  Georgia 
Institute  of  Technology,  Atlanta,  Georgia  30332,  U.S.A. 


tile  BCD  outputs  of  the  amplitude,  phase,  and  synchro  displays  at  the  desired 
sample  points  as  Indicated  by  the  positioner  programmer.  The  measured  near* 
field  data  may  be  stored  on  the  microcomputer's  floppy  disk  and  later  transferred 
to  a  minicomputer  via  a  serial  data  line  after  the  measurements  are  complete.  It 
Is  also  possible  to  send  the  data  directly  to  the  minicomputer  as  the 
measurements  are  being  made. 

Computer  programs  based  on  the  analysis  outlined  in  [  3  J  have  been 
written  in  FORTRAN  to  convert  cylindrical  near-field  measurements  to  far-lield 
patterns.  Experiments  have  been  run  on  two  array  antennas  as  part  of  the 
validation  process  with  comparisons  made  between  computed  and  measured  far- 
field  patterns.  The  effects  of  probe  compensation  and  measurement  surface 
truncation  have  been  studied.  Presently  a  cylindrical  near-field  measurement 
program  which  will  provide  relative  far-fieid  phase  data  as  well  as  amplitude  data 
is  being  developed. 

PROBE  COMPENSATION  FOR  THE  SPHERICAL  MEASUREMENT  SYSTEM 

The  implementation  of  a  spherical  near-field  range  on  the  main  campus  is 
being  undertaken.  However,  the  probe  correction  for  the  spherical  scanning 
geometry  is  difficult.  Not  only  is  the  mathematical  basis  complicated,  but  also 
numerical  implementation  is  neither  straightforward  nor  efficient.  An  alternate 
derivation  of  the  spherical  surface  near-field  coupling  equation  is  being  studied 
in  an  effort  to  simplify  these  problems. 

The  spherical  near-field  measurement  surface  is  extremely  attractive  for 
two  reasons.  First,  the  measurements  can  be  performed  using  conventional  far- 
field  antenna  positioners.  Second,  the  calculated  patterns  cover  up  to 
steradians  rather  than  some  limited  angular  region  of  space.  The  coupling 
equation  between  a  test  antenna  and  a  measuring  probe  for  the  spherical 
geometry  was  first  derived  ini  <(I*  Simplifications  to  this  theory  for  single  mode 
probes  and  efficient  numerical  algorithms  for  data  reduction  were  developed  in 
[5>  The  single  mode  probe  assumption  greatly  simplifies  the  probe  correction  by 
eliminating  all  terms  but  one  in  the  double  summation  of  spherical  wave 
harmonics  representing  the  principal  polarization  component  of  the  probe 
response  and  all  terms  representing  its  cross-polarization  component  from  the 
coupling  equation.  This  essentially  removes  the  need  for  probe  correction 
because  the  probe  measures  only  a  single  component  of  the  near-field  of  the  test 
antenna. 

Single  mode  probes  can  be  very  difficult  to  fabricate,  for  they  must  have 
high  polarization  ratios.  In  addition,  they  can  have  a  very  low  gain,  e.g.  in  cases 
where  the  probe  is  designed  to  respond  to  a  radial  near-field  component.  The 
frequency  sensitivity  of  these  probes  is  also  a  problem.  Small  changes  in 
frequency  can  cause  them  to  no  longer  act  as  a  single  mode  probe.  Thus  it  may 
be  necessary  to  design  the  probe  for  each  specific  test  frequency. 

The  problems  associated  with  single  mode  probe  fabrication  can  be 
circurr-  ented  by  neglecting  the  probe  compensation  problem  altogether. 
Compared  to  the  planar  and  cylindrical  surfaces,  the  spherical  scanning  surface  is 
the  least  sensitive  to  probe  correction,  and  many  have  reported  acceptable  results 
by  neglecting  it.  However,  a  simplified  probe  correction  scheme  which  does  not 
require  a  single  mode  probe  is  desirable.  This  has  been  a  recent  topic  of  research 
at  Georgia  Tech.  Several  approaches  to  the  problem  have  been  investigated.  The 
most  promising  solution  has  shown  that  the  polarization  ratio  of  the  measuring 
probe,  not  its  directional  characteristics,  Is  the  majo*  factor  in  probe  correction  for 
the  spherical  surface.  Although  this  work  is  not  yet  complete,  it  appears  that  it 
will  be  possible  to  calculate  tne  two  principal  plane  far-field  pattern  components 
of  the  field  radiated  by  the  test  antenna  while  correcting  for  the  polarization 


ratio  of  the  probe.  A  significant  anticipated  result  is  that  these  pattern 
components  can  be  calculated  as  a  summation  oi  scalar  tcsseral  harmonics  rather 
than  vector  harmonics  over  the  far-field  angular  coordinates.  This  research 
would  be  completed  within  the  next  six  months. 

SIMPLIFIED  PRCBE  POSITION  ERROR  COMPENSATION 

A  simpiified  probe  position  error  compensation  algorithm  recently  has 
been  developed  for  planar  near-field  measurement  systems.  The  technique 
compensates  for  known  probe  position  errors  as  large  as  two  wavelegths  in  the  x, 
y  and  z  positions.  This  simpiifiea  technique  was  an  outgrowth  of  the  rigorous 
theory  developed  by  Corey  and  Joy  at  Georgia  Tech  in  1980(6).  Current  research 
is  directed  toward  applying  the  simplified  probe  position  error  compensation 
technique  to  spherical  surface  near-field  measurement  systems. 

APPROXIMATE  TECHNIQUES  FOR  NEAR-FIELD  PROBE  POSITION  ERROR 
COMPENSATION 

Error  analyses  conducted  at  Georgia  Tech  [7,81  the  National  Bureau  of 
Standards  ( 9,10)  and  the  European  Space  Agency  (11,125  have  shown  that  the 
positioning  accuracy  requirement  for  planar,  cylindrical  and  spherical  near-field 
measurement  systems  is  on  the  order  of  one  hundredth  of  a  wavelength.  This 
requirement  severely  limits  the  upper  frequency  usage  of  the  near-field 
measurement  technique.  One  approach  to  overcoming  this  limitation  is  to 
compensate  for  the  position  error  by  including  the  position  of  each  measurement 
in  the  near-field  to  far-field  computation  process.  This  requires  two  additions  to 
a  near-field  measurement  system.  First,  a  position  indicating  system  must  be 
added  to  the  near-field  apparatus  to  accurately  determine  the  position  of  the 
near-field  probe  or  the  antenna  under  test,  or  both,  depending  on  the  type  of 
near-field  system  being  used.  An  example  of  a  very  high  accuracy  position 
indicating  system  is  the  commercially  available  interferometer  system  which 
measures  distance  to  within  one  quarter  of  a  wavelength  of  its  operating  light. 
Second,  position  error  compensation  software  must  be  added  to  utilize  the 
position  error  information  in  the  far-fieJo  pattern  computation  sequence.  This 
paper  addresses  this  second  aspect  of  position  error  compensation  for  planar 
near- field  measurement  systems.  Specifically  a  class  of  approximate  teenniques 
has  been  developed  which  is  computationally  efficient  as  compared  to  the  earlier 
technique  of  Corey  and  Joy  ( 13) 

THE  K-CORRECTION  TECHNIQUE 

If  the  assumption  is  made  that  all  near-field  energy  is  propagating  in  the 
direction  of  the  main  beam  of  the  antenna  under  test,  k^B  »  then  the  fields 
existing  at  equally  spaced  points  on  the  ideal  measurement  plane  (xj,  yj,  0)  can  be 
easily  calculated  from  the  measurements  taken  at  the  points  (xq,  yp,  z)<),  located 
on  a  nearby  surface.  The  relationship  between  the  field  patterns  at  these  two 
points  is  given  in  terms  of  the  familiar  plane  wave  propagation  equation  as 

E(x,,  yj,  0)  =  £(xk,  yv,  zk)  exp(-j  RMB  *  r«) 

where 

Hi  ■*  H  -  He  *  ^xi  "  *k)  ax  ♦  (Yi  *  Yk)  ay  ♦  (0  -  Z^)  az 

The  field  values  with  the  location  rj_closest  to  the  line  passing  througn  the  the 
desired  field  value  location  rj  in  the  k^D  direction  is  used  to  determine  the  field 
value  at  the  desired  location. 


This  technique  has  been  applied  to  the  following  example.  Figure  1 
shows  a  normalized  out-of-plane  probe  position  error  for  a  planar  near-field 


measurement  system.  The  maximum  value  of  this  error  is  shown  as  1.0.  Effects  of 
varying  this  maximum  value  of  probe  position  error  on  the  computed  far-field 
patterns,  with  and  withou*  position  error  compensation  will  be  shown.  Figure  2 
shows  the  lar-field  pattern  of  the  antenna  under  test  computed  from  near-field 
measurements  with  no  probe  position  error.  Figures  3  and  4  show  the  far-field 
patterns  of  the  antenna-under-test  calculated  from  near-field  measurements 
made  on  the  measurement  surlace  of  Figure  1  with  maximum  position  error  of 
0.1  wavelength  and  0.3  wavelength  respectively.  These  figures  show  rapid 
deterioration  of  the  far-field  pattern  accuracy  with  increasing  position  error.  In 
these  calculations  position  error  compensation  was  not  used.  Complete 
restoration  of  these  patterns  (down  tc  a  level  of  -40  dB)  is  possible  for  the  case  of 
2.0  wavelength  or  less  maximum  near-field  probe  position  error  by  using  the  K- 
correction  technique  described  above.  The  accuracy  of  this  calculated  pattern  is 
seen  to  be  excellent  for  this  hypothetical  case,  where  k^B  *s  assumed  to  be 
known  accurately.  Computation  time  for  this  64  by  64  point  near-field  data  set  is 
less  than  five  seconds  (CDC  Cyber  74). 


The  K -correction  technique  for  position  error  compensation,  _  which 
assumes  the  near-field  being  measured  propagates  in  the  single  direction  k^g  is 
now  extended.  Let  it  be  assumed  that  the  near-field  contains  components 
propagating  in  N  directions  simultaneously.  Let  these  directions  be  kn,  1  in  1  N. 
The  choice  of  the  N  directions  of  propagation,  in  practice,  would  be  the  directions 
of  propagation  of  the  highest  far-field  pattern  levels.  The  Nyquist  sampling 
theorem  applied  to  high  gain  antennas  shows  that  the  main  beam  region  of  the 
far-field  pattern  can  be  represented  by  4  to  S  plane  waves  depending  on  the 
aperture  efficiency  of  the  antenna.  The  specification  of  from  4  to  S  plane  waves 
spread  equally  throughout  the  main  beam  region  would  be  a  good  choice  for  the 
N  directions.  Note  that  the  exact  direction  of  propagation  of  the  main  beam  is 
not  required.  A  contiguous  group  of  N  near-field  measurements,  usually  within  a 
circular  or  square  region  of  the  measurement  plane,  is  represented  as  a 
summation  of  N_  plane  waves,  each  propagating  in  one  of  the  specified  N,  kn 
directions.  Let  E(rj<),  1  <^k  <  N  be  the  set  of  N  measured  values  located  at  the  N 
points  rj<,  1  <  k  <_  N.  These  field  values  may  be  written  in  terms  of  N  plane  waves 
with  unknown  complex  amplitudes  A  (R"n)t  1«  n  <  N  as 

N 

£C.7i >  =E  A(Rn)  exp(-j  kn  •  rj<),  1  Ikf  N 

n*l 

.  'is  set  of  N  simultaneous  equations  may  be  solved  for  the  unknown  plane  wave 
'•'tudes  A  0rn),  1 1  n  1  N  and  then  used  to  calculate  field  values  elsewhere  and 
ailarly  at  the  desired  equally  spaced  point  on  a  nearby  planar  surface  f]  as 

-  -  N  _ 

E(rj)  A  Ckn) exp(-j fcn  •  rj) 
n=I 

.tis  latter  evaluation  is  normally  cnly  carried  out  for  a  few  points  near  the  center 
of  the  N  point  group.  A  new  N  point  group  is  then  selected,  usually  overlapping 
the  previous  group,  and  the  above  process  repeated  until  all  equally  spaced 
planar  field  points  are  determined.  The  extended  K-correction  technique  with 
N*9  where  all  9  plane  wave  directions  are  within  the  main  beam  region  has 
shown  to  give  excellent  results  when  the  main  beam  direction  is  known  to  within 
one  main  beam  beamwidth. 

Work  at  Geogia  Tech  continues  to  fully  explore  »he  limitations  of  the 
extended  K-correction  position  error  compensation  technique  and  to  apply 
similar  techniques  to  the  cylindrical  and  spherical  near-lield  measurement 
systems. 
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FICURE  3.  UNCOMPENSATED  FAR-FIELD 
PRINCIPAL  PLANE  PATTERN 
0.1  WAVELENGTH  PROBE  POSITION  ERROR 


FIGURE  4.  UNCOMPENSATED  FAR-FIELD 
PRINCIPAL  PLANE  PATTERN 
0.5  WAVELENGTH  PROBE  POSITION  ERROR 
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Introduction 

This  paper  reporta  on  reaearch  being  conducted  at  Georgia  Tech  on  the 
apherical  aurface  near-field  aeaaureaent  technique.  The  popularity  of  the 
apherical  aurface  near-field  aeaaureaent  technique  la  indicated  in  the  Hat  of 
near-field  rangea  aa  ahown  in  Table  I.  Thla  popularity  la,  in  large  part,  due 
to  the  availability  of  the  Scientific  Atlanta  Spherical  Mear-Field  Antenna 
Analyser.  Specifically,  the  paper  reporta  on  the  atatua  of  (1)  the  Georgia 
Tech  apherical  aurface  near-field  range,  (2)  comparison  of  non -probe- 
compensated  apherical  surface  near-field  to  far-fleld  transformation 
techniques,  (3)  a  probe  position  error  compensation  technique  for  apherical 
aurface  measurements,  and  (4)  alternative  apherical  aurface  near-field  to  far- 
fleld  transformations  which  include  probe  compensation. 

Georgia  Tech  Spherical  Surface  Eear-Fleld/Par-Field  Range 

Figure  1,  shows  a  pictoral  diagram  of  the  Georgia  Ted)  spherical  aurface 
near-field/far-field  antenna  measurement  range  located  in  the  School  of  Elec¬ 
trical  Engineering.  The  range  is  located  in  a  tapered  anecholc  chamber  of 
dimensions  of  40*  long  by  24*  wide  by  IS'  high.  The  instrumentation  and 

•This  work  is  being  supported  by  the  Joint  Services  Electronics  Program. 
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absorbing  material  wo  designed  Cor  single  band  operation  (too  0  to  12  CBa. 
Tba  facility  la  naad  both  Cor  taaaareb  and  instruction  In  antanna  measure¬ 
ments,  antennas,  and  radcoaa.  tba  range  la  now  Cully  functional  In  tba  manual 
aoda  and  vlll  aoon  bo  completely  IntarCaead  to  tba  microoova  eenputar  for 
automated  amplitude,  phase,  and  polarisation  data  collection.  Hear- field  to 
far-f leld  tranaCoraation  la  carried  out  on  other  campus  computers. 

Comparison  of  bao-Probe-Compensated  Spbarlcal  durfeoe  ■ear-Pield  to  rar-Pield 
Transformation  fedmlpms 

Comparison  of  two  techniques  for  non -probe -compensated  spherical  surface 
near-field  to  far-f leld  transformation  ara  being  covtducted.  Tba  two  tech¬ 
niques  ara  (1)  the  spherical  node  expansion  technique,  end  (2)  the  'rector- 
diffraction  surface-integration  technique. 

The  spherical  mode  expansion  technique  hes  become  the  standard  technique 
for  spherical  surface  near-field  to  fer-field  transformations.  The  advantages 
of  this  technique  are  (1)  only  the  tangential  components  of  the  electric  field 
need  be  measured  on  the  spherical  near-field  surface,  end  (2)  the  spherical 
mode  expansion  formulation  allows  for  the  natural  incorporation  of  probe 
compensation.  The  disadvantages  of  the  spherical  mode  expansion  technique 
are  a  direct  result  of  the  mathematical  complexity  of  the  spherical  mode 
descriptions.  The  disadvantages  are  (1)  lengthy  computation  time,  (2)  lengthy 
computer  algorithms,  and  (3)  difficulty  of  probe  position  error  oos^enaatlon. 

The  vector  diffraction  surface-integration  technique  is  a  very  old  tech¬ 
nique  which  la  one  of  the  standard  techniques  uaed  to  calculate  far-f  leld 
antenna  patterns  from  antenna  aperture  fields  and  from  the  fields  on  the  outer 
surface  of  radomea.  The  far-fleld  patterns  are  calculated  by  an  Integration 


of  the  ipwlfltd  on  i  surface  which  omIomi  Mm  entenna  nndor 
toot*  Mertwittlyi  both  tho  tangential  oXoetrle  and  magnetic  fialda  on  tha 
anelooing  surface  mat  bo  measured.  X(  tho  free  space  eolation  batwaan  tha 
alaotrlo  and  magnetic  fields  on  tha  anelooing  surface  la  made*  tha  resulting 
near- field  to  far-f laid  transformation  la  simple.  The  advantages  of  the 
vector  diffraction  surf ace- Integration  technique  are  (1)  mathematically  and 
algorithmically  simple,  and  (2)  easy  compensation  for  probe  position  error. 
The  disadvantages  are  (1)  assumption  of  free  apace  relationship  between  the 
electric  and  magnetic  fields,  and  (2)  difficulty  of  incorporating  probe 
response. 

Probe  Position  Error  Compensation  Technique  for  Spherical  Surface  Kear-Pield 
Measurements 

Several  theoretical,  computer  simulation  and  empirical  studies  have  been 
conducted  to  determine  the  requirements  tor  position  accuracy  for  near-field 
measurements.  The  most  stringent  positioning  requirements  for  the  spherical 
surface  systems  are  approximately  one  two-hundredth  (1/200)  of  a  wavelength 
for  the  radius  and  offset  between  the  two  axes  of  rotation.  Thus  position 
errors  should  be  leas  than  ^.0C6*  (+.015  cm)  at  10  Or. 

experience  gained  with  probe  position  error  compensation  for  planar 
surface  near-field  measurements  la  being  applied  to  spherical  surface  near- 
field  measurements.  The  vector  diffraction  surface-integration  technique 
easily  accommodates  positional  errorr  as  the  measurement  surfaoe  can  take  on 
any  shape.  The  assumed  spherical  surface  la  dlstored  to  coincide  with  the 
actual  surface  over  which  measurements  are  preformed.  The  surface  used  in  the 
surface-integration  la,  therefore,  the  actual  surface  and  position  error  la 


twBwd.  The  spherical  aodt  expansion  technique  cinoot  Hilly  Mowwditi  a 
•onybitteil  HiiUMint  surface.  An  approximate  probe  position  error  tech¬ 
nique,  called  the  R-correction  technique,  hee  been  developed  in  which  ell 
near-field  energy  is  aesunsd  to  propagate  in  the  redial  direction  for  the 
purpose  of  probe  position  error  compensation.  Each  near-field  sample  is 
aesunsd  to  represent  the  amplitude  end  phase  of  a  plane  wave  propagating  in 
the  local  radial  direction.  Ideally  positioned  a  ample a,  those  at  integer 
nultiples  of  the  two  angular  sanple  increments  and  at  constant  radius,  are 
determined  from  the  measured  sample  by  evaluating  the  plane  wave  at  the  ideal 
position.  This  R-correction  technique  is,  thus,  a  phase  only  correction,  but 
has  proven  highly  successful  in  our  initial  investigations.  The  R-correction 
technique  can  be  upgraded,  just  as  in  the  planar  surface  ease,  to  remove  the 
assumption  of  radial  only  propagation.  The  modified  R-correctlon  technique 
uses  several  neighboring  near-field  measurements  and  their  true  positions  to 
estimate  the  direction  of  propagation  of  tha  local  field  and  uses  the  esti¬ 
mated  direction  rather  than  the  assumed  radial  direction  for  probe  position 
compensation. 

Both  techniques  tor  probe  position  error  compensation  rely  on  accurate 
probe  position  data  for  each  near-field  measurement.  Normally,  this  position 
information  is  obtained  from  an  auxilllsry  position  measurement  aystem. 

Probe  rompansated  Spherical  Surface  Near -Field  to  Par-Field  Transformation 

two  techniques  for  the  probe  response  compensation  for  spherical  surface 
near-field  to  tar-field  transformation  are  being  developed  at  Georgia  Tech. 
The  first  technique  is  a  rigorous  rederlvatlon  of  spherical  near-field 
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measurement  coupling  wins  bull  functions  other  thm  the  spherical  note 
•sponsion.  This  work  Is  being  oenduoted  by  V.  M.  Leech.  Jr.  The  second  tech¬ 
nique  is  an  approximate  technique  to  be  used  in  the  sector  diffraction 
surface- integration  technique. 

The  offset  of  the  near-field  probe  directional  response  le  represented, 
as  in  planar  aurface  near-field  measurements,  as  the  convolution  of  the 
probe's  planar  aperture  field  and  the  local  near-field  neasurenent  surface 
field.  It  is  found  that  compensation  for  the  near-field  probe  response  is 
dependent  on  the  ratio  of  spatial  area  represented  by  each  near-field 
measurement  sample  and  the  spatial  area  of  the  near-field  probe. 

This  rstic<  becomes  large  when  the  radius  of  the  spherical  near-field 
measurement  surface  becomes  significantly  larger  than  the  radius  of  the 
antenna  under  test  sperture.  This  ratio  would  be  approximately  25  for  a 
measurement  radius  10  times  larger  than  the  radius  of  the  antenna  under  test, 
and  a  one  wavelength  square  near-field  probe.  For  such  large  ratios,  the 
effects  of  the  probe  directional  response  become  insignificant.  The  ratio  is 
minimum  when  measurements  are  performed  at  minimum  distances.  This  minimum 
ratio  would  be  approximately  one-fourth  (1/4)  using  a  one  wavelength  square 
probe.  For  such  small  ratios,  the  convolution  of  the  probe  spatial  response 
and  near-field  measurements  in  the  sampling  area  is  significant  and  Included 
in  the  far-fleld  computation. 

••search  la  underway  at  Georgia  Tech  to  Increase  the  accuracy  and 
computational  efficiency  of  the  spherical  surface  near -field  measurement 
technique. 
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X.  Introduction 


Alia  supplement  to  tba  annual  report  consists  of  tbs  following  printed 
table  of  contents  and  a  set  of  Microfiche  containing  all  papers  and  theses 
produced  with  JSEP  support  and  published  during  the  period  April  1,  1983 
through  March  31,  1984. 

This  for*  of  reporting  is  Modelled  after  that  introduced  by  the  Stanford 
Electronics  Laboratories  for  the  sane  purpose.  The  result  is  a  con  pact  pre¬ 
sentation  of  a  large  quantity  of  lnfornation  which  can  be  produced  Much  aore 
economically  than  printing.  Ot  the  other  hand,  it  is  realised  that  Microfiche 
is  less  convenient  than  a  printed  docusent.  Therefore,  those  who  are 
Interested  in  particular  reprints  nay  contact  R.W.  Sr*-.-;*,  to  request  a  xerox 
copy  of  any  of  the  listed  papers. 

II.  List  of  Be  pc  ints 

The  reprints  are  organized  by  work  unit  as  in  the  c cabined  Annual/Pinal 
Report  on  this  contract.  Numbers  in  parenthesis  indicate  reference  to  flche 
number  and  page.  The  page  numbers  are  coded  to  the  work  unit  nunbers.  Mote 
that  fiche  17  contains  this  printed  index. 

2.1  TNO-OIMBBXONU.  SIGNAL  PROCESS  HI  G  AMD  STORAGE 

M0I1  Constrained  iterative  Signal  Restoration  Algorithms 
B.M.  Meraereau  and  R.W.  Schafer 

A.G.  Katsaqgelos  and  R.W.  Schafer,  'Iterative  Deconvolution  Using 
Several  Different  Distorted  Versions  of  an  Unknown  Signal,*  Proc.  1983 
Int.  Conf«  on  Acoustics,  Speech,  and  Signal  Processing,  Boston,  pp. 
659-462,  April  1983.  (Fiche  II,  pp.  1-1  to  1-4.) 

M.H.  Hayes  and  R.w.  Schafer,  *0n  the  Bandlinited  Extrapolation  of 
Discrete  Signals,*  Proc.  1983  Int.  Conf.  on  Acoustics,  Speech,  and 
Signal  Processing,  Boston,  pp.  1450-1453,  April  1983.'  (Fiche  #1,  pp. 
1-5  to  1-8.) 


RO|2  Spectrum  Analysis  and  Parametric  Modelling 
R.W.  Schafer  and  R.M.  Meraereau 

R.M.  Mersercau,  E.W.  Brown,  and  A.  Guessoua,  "Row-Column  Algorithms 
for  the  Evaluation  of  Multidimensional  DFTa  on  Arbitrary  Periodic 
Sampling  Lattices,"  Proc.  IEEE  Int.  Conf.  on  Acoustics,  Speech,  and 
8iqnal  Processing,  pp.  1264-1267,  Apr.  1983.  (Fiche  #1,  pp.  2-1  to 
2-4.) 


R.M.  Mersereau,  "Dlnensionallty  Changing  Transformation  with  Mon- 
Rectangular  Sampling  Strategies,"  in  Transformations  in  Optica, 
(Rhodes,  Saleh,  Fienup,  eds.)  SPIE  Bellingham,  1983  (invited).  (Fiche 
II,  pp.  2-5  to  2-9.) 

A.  Guessoum,  "Fast  Algorithms  for  the  Multidimensional  Discrete 
Fourier  Transform,"  Ph.D.  Thesis,  Georgia  Institute  of  Technology, 
March  1984.  (Fiche  II,  pp.  2-10  to  2-90  and  Fiche  12  pp.  2-91  to 
2-170.) 


I.J.  Li*,  "Generalisation  of  One-Disenaional  Algor  ithaa  for  tha 
evaluation  of  Multidimensional  Circular  Convolutions  and  DfTm,"  M.S. 
Ifceals,  Georgia  Institute  of  Technology,  December  1983.  (rich*  *2, 
pp.  2-171  to  2-188  and  riche  |3,  pp.  2-189  to  2-284.) 

».A.  Naragoa,  R.M.  Meraereau,  and  R.w.  Schafer  •  "Two-Dimensional 
Linear  Predictive  Analysis  of  Arbitrarily  Shaped  Regions,"  Proc.  IEEE 
Xnt.  Conf.  on  Acouatlca,  Speech,  and  Signal  Processing,  pp,  104-107, 
Apr.  1983.  (riche  *4,  pp.  2-28S  to  2-288.) 

Signal  Reconstruction  Pro*  Partial  Idas*  and  Magnitude  Information 

K.B.  Bayes 

P.L.  Van  Hove,  M.H.  Hayes,  J.S.  Lis,  and  A.V.  Oppenheia,  "Signal 
Reconstruction  fron  Signed  Fourier  Trans  for*  Magnitude,"  IEEE  Trans. 
Acoust.,  Speech,  and  Signal  Processing,  ASSP-31,  pp.  1286-1293,  Oct. 
1983.  (Piche  14,  pp.  3-1  to  3-8.) 

M.H.  Hayes  and  T.P.  Quatierl,  "Recursive  Phase  Retrieval  Using 
Boundary  Conditions,"  J.  Opt.  Soc.  Aw.,  Vol.  73,  pp.  1427-1433,  Nov. 
1983.  (riche  84,  pp.  3-9  to  3-15.) 

M.H.  Hayes,  "The  Representation  of  Signals  in  Terms  of  Spectral 
Amplitude,"  Proc.  1983  Int,  Conf.  on  Acoust.,  Speech,  and  Signal 
Processing .  pp.  1446-1449,  April  1983.  (riche  44,  pp.  3-16  to  3-19.) 

Multiprocessor  Architectures  for  Digital  Signal  Processing 
T.P.  Barnwell,  III 

"Optical  Implementation  of  Plow  Graphs  on  Synchronous  Multi¬ 
processors,"  T.P.  Barnwell,  III,  and  D.A.  Schwarts,  Proceedings  of 
Aailomar  Conference  on  Circuits  and  Systeaa,  November  1983.  (riche 
14,  pp.  4-1  to  4-7.) 

Two-Di*en*ional  Optical  Storage  and  Processing 
T.K.  Gaylord 

Mohara*,  M.  G.  and  Gaylord,  T.  K. ,  "Rigorous  Coupled -Wave  Analysis  of 
Grating  Diffraction  —  E  Mode  Polarisation  and  Losses,"  Journal  of  the 
Optical  Society  of  Awerica,  vol.  73,  pp.  451-455,  April  1983.  (Piche 
#4,  pp.  5-1  to  5-5.) 

Mohara*,  M.  G.  and  Gaylord,  T.  K. ,  "Three-Dimensional  Vector  Coupled- 
Wave  Analysis  of  Planar-Grating  Diffraction,"  Journal  of  the  Optical 
Society  of  America,  vol.  73,  pp.  1105-1112,  September  1983.  (Piche 
#4,  pp  5-6  to  5-13.) 


Baird,  W.  E.,  Mohara*,  M.  G.,  and  Gaylord,  T.  K. ,  "Diffraction 
Characteristics  of  Planar  Absorption  Gratings,"  Applied  Physics  B, 
vol.  32,  pp.  15-20,  September  1983.  (riche  14,  pp.  5-14  to  5-19.) 

Mohara*,  M.  G.,  Gaylord,  T,  K. ,  Sincerbox,  G.  T.,  Werllch,  H.  and 
Tung,  B.,  "Diffraction  Characteristics  of  Surface-Relief  Dielectric 
Gratings,"  (Abstract)  Journal  of  the  Optical  Society  of  America,  vol. 
73,  pg.  1941,  December  1983.  (fiche  #4,  pp.  5-20.i 


Moharam,  M.  6.  and  Gaylord ,  t.  X. ,  “Diffraction  of  Pin  it*  Beau  by  Di- 
•lactr ic  Grating* ,"  (Abstract)  Journal  of  tha  Optical  Society  of 
Jjgarica,  vol .  73,  pg .  1941,  December  1983.  (Picha  #4,  pp.  5-20.) 

Mirsalehi,  M.  H.,  Guaat,  C.  C. ,  and  Gaylord,  T.  K. ,  “Optical  Truth- 
Tabla  Look-Op  Processing  of  Digital  Data,*  (Abstract)  Journal  of  the 
Optical  Society  of  America.  vol.  73,  pg.  1951,  Deceaber  1983.  (Piche 
14,  pp.  5-21.) 

Baird,  W.  B. ,  Gaylord,  T.  X.,  and  Mo  bars*,  h.  G.,  ‘Diffraction 
Efficiencies  of  Transmission  Absorption  Gratings,*  (Abstract)  Journal 
of  the  Optical  Society  of  America.  vol.  73,  pg.  1889,  Deceaber  1983. 
(Piche  44,  pp.  5-22.) 

Mirsalehi,  M.  M. ,  Guest,  C.  C  ,  and  Gaylord,  T.  X.,  'Residue  Number 
system  Holographic  Truth-Table  Look-Up  Processing:  Detector  Threshold 
Setting  and  Probability  of  Error  Due  to  Aaplitude  and  Phase 
Variations,*  Applied  Optics,  vol.  22,  pp.  3583-3592,  November  15, 
1983.  (Piche  14,  pp.  5-23  to  5-32.) 

Guest,  C.C.,  ‘Holographic  Optical  Digital  Parallel  Processing,*  Ph.D. 
Thesis,  Georgia  Institute  of  Technology,  November  1983.  (Piche  84, 
pp.  5-33  to  5-48  and  Piche  15,  pp.  5-69  to  5-166  and  Piche  16,  pp. 
5-167  to  5-184.) 

Hybrid  Optical/Digital  Signal  Processing 
W.T.  Rhodes 

J.N.  Halt  and  W.T.  Rhodes,  "Dependent  and  Independent  Constraints  for 
a  Multiple  Objective  Iterative  Algorithm,*  in  Signal  Recovery  and 
Synthesis  with  Incomplete  Informetion  and  Partial  Constraints 
(Technical  Digest)  (Optical  Society  of  America,  1983),  pp.  THA14-1 
through  THAI 4-4.  (Piche  «6,  pp.  6-1  to  6-4.) 

W.T.  Rhodes,  A.  Tarasevich,  and  N.  Eepkin,  ’Complex  Covariance  Matrix 
Inversion  with  a  Resonant  Electro-Optic  Processor,"  in  Two-Dimensional 
Image  and  Signal  Processing.  G.  Morris,  ed.  (Proc.  SPIE,  Vol.  388, 
1983),  pp.  197-204.  (Piche  *6,  pp.  6-5  to  6-12.) 


W.T.  Rhodes  and  M.  Koizumi,  "Image  Enhancement  by  Partially  Coherent 
Imaging,"  in  Proceedings  of  the  10th  International  Optical  Computing 
Conference  (IEEE  Computer  Society,  1983,  IEEE  Order  No.  83CH1880-4), 
pp.  32-35.  (Piche  46,  pp.  6-13  to  6-16.) 

W.T.  Rhodes,  "Hybrid  Time-  and  Space- In  teg  ret  ion  Method  for  Computer 
Holography,*  in  International  Conference  on  Computer-Generated  Holo¬ 
graphy,  S.  Lee,  ed.  (Proc.  SPIE,  Vol.  437,  1983),  pp.  xx-xx.  (Piche 
•  6,  pp.  6-17  to  6-22.) 

W.T.  Rhodes,  "Acousto-Optic  Algebraic  Processors,”  in  Real-Time  Signal 
Processing  VI.  X.  Bromley,  ed.  (Proc.  SPIE,  Vol.  431,  1983),  pp.  xx- 
xx.  (Piche  16,  pp.  6-23  to  6-33.) 

H.J.  Caulfield,  J.A.  Neff,  and  W.T.  Rhodes,  "Optical  Computing:  Hie 
Coming  Revolution  in  Optical  Signal  Processing,"  Laser  Focus/Electro- 
(fetics  Magazine,  November  1983,  pp.  100-110  (invitee.).  (Piche  46,  pp. 
6-34  to  6-42.) 
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6.S.  Smith  and  UK,  An,  "Loop  Antennas  foe  Directive  Transmission  into 
«  Mater ial  Half  Space,*  Radio  Science,  sol.  tl,  no.  5,  pp.  884-874, 
Sept. -Oct.  1983.  (Piche  #7,  pp.  7-1  to  7-11.) 

■.X.  Baa  sen  and  G.S.  Smith,  ’Electric  Field  Probes  -  A  Review,” 
(Invited  Paper),  IBEB  Trans.  Antennas  end  Propagation;  vol.  AP-31,  no. 
5,  pp.  71*. -71 8,  Sept.  1983.  (Piche  17,  pp.  7-12  to  7-20.) 

C.S.  Smith,  ’Directive  Properties  of  Antennas  for  Transmission  into  a 
Material  Half  Space,”  I  EES  Trans.  Antennas  and  Propagation,  vol.  AP- 
32,  no.  3,  pp.  232-246,  March  1984.  (Also  presented  at  the  1983  IEEE 
Antennas  and  Propagation  Society  International  Symposium  and  National 
Radio  Science  Meeting  (ORSI),  Houston,  TX,  pg.  7,  Kay  1983.)  (Piche 
#7,  pp.  7-21  to  7-35.) 

G.S.  Smith,  'Limitations  on  the  Sise  of  Miniature  Electric  Field 
Probes,”  IEEE  Trans.  Microwave  Theory  and  Techniwas,  volume  *•  XT-32, 
no.  6,  pp.  594-600,  June  1984.  (Piche  #7,  pp.  7-36  to  7-42.) 

G.S.  Smith,  ”Loop  Antennas,”  in  Antenna  Engineering  Handbook .  (R.C. 
Johnson  and  H.  Jasik,  Eds.,  New  York:  McGraw-Hill,  pp.  5-1  to  5-24, 
1984.  (Fiche  17,  pp.  7-43  to  7-87.) 


Automated  Radiation  Measurements 

Transformations 

K.B.  -toy 


and  Far -Field 


V.V.  Tory,  E.8.  Joy,  and  K.M.  Leach,  Jr.,  "Current  Antenna  Near-Field 
Measurement  Research  at  the  Georgia  Institute  of  Technology," 
Proceedings  of  the  13th  European  Microwave  Conference,  Nurnberg,  West 
Germany,  September  5-8,  1983,  pp.  8-23,  8-28.  Piche  47,  pp.  8-1  to 
8-6.) 


E.B.  Joy,  'Spherical  Surface  Near-Field  Measurements,”  Proceedings  of 
Antenna  Measurement  Techniques  Association 


Annapolis,  MD,  September  27-29,  1983,  pp.  23-1,  23-8.  (Fiche  17,  pp. 
8-7  to  8-12.) 


The  last  five  pages  of  Fiche  47  contain  the  above  list  of 
publications. 
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